Collection of scripts to pinpoint obfuscated code

Last update: Nov 26, 2022

Related tags

Text Data & NLP obfuscation_detection

Overview

Obfuscation Detection (v1.0)

Author: Tim Blazytko

Automatically detect control-flow flattening and other state machines

Description:

Scripts and binaries to automatically detect control-flow flattening and other state machines in binaries.

Implementation is based on Binary Ninja. Check out the following blog post for more information:

Automated Detection of Control-flow Flattening

Usage

$ ./detect_flattening.py samples/finspy 
Function 0x401602 has a flattening score of 0.9473684210526315.
Function 0x4017c0 has a flattening score of 0.9981378026070763.
Function 0x405150 has a flattening score of 0.9166666666666666.
Function 0x405270 has a flattening score of 0.9166666666666666.
Function 0x405370 has a flattening score of 0.9984544049459042.
Function 0x4097a0 has a flattening score of 0.9992378048780488.
Function 0x412c70 has a flattening score of 0.9629629629629629.
Function 0x412df0 has a flattening score of 0.9629629629629629.
Function 0x412f70 has a flattening score of 0.9927007299270073.
Function 0x4138e0 has a flattening score of 0.9629629629629629.

Note

The password for the zipped malware samples is "infected". To unpack, use the following command line:

$ unzip -P infected samples.zip

Contact

For more information, contact @mr_phrazer.

A collection of models for image - text generation in ACM MM 2021.

Bi-directional Image and Text Generation UMT-BITG (image & text generator) Unifying Multimodal Transformer for Bi-directional Image and Text Generatio

63 Oct 30, 2022

An open collection of annotated voices in Japanese language

声庭 (Koniwa): オープンな日本語音声とアノテーションのコレクション Koniwa (声庭): An open collection of annotated voices in Japanese language 概要 Koniwa(声庭)は利用・修正・再配布が自由でオープンな音声とアノテ

32 Dec 14, 2022

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

AliceMind AliceMind: ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab This repository provides pre-trained encode

922 Dec 10, 2021

Code examples for my Write Better Python Code series on YouTube.

Write Better Python Code This repository contains the code examples used in my Write Better Python Code series published on YouTube: https:/

858 Dec 29, 2022

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

This codebase is being actively maintained, please create and issue if you have issues using it Basics All data files are included under losses and ea

32 Nov 9, 2021

Code for the Python code smells video on the ArjanCodes channel.

7 Python code smells This repository contains the code for the Python code smells video on the ArjanCodes channel (watch the video here). The example

55 Dec 29, 2022

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation This is the official PyTorch implementation

564 Jan 8, 2023

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2. It is trained (finetuned) on a curated list of approximately 45K Python (~470MB) files gathered from the Github. Currently, it just works properly on Python but not bad at other languages (thanks to GPT-2's power).

91 Sep 23, 2022

Code-autocomplete, a code completion plugin for Python

Code AutoComplete code-autocomplete, a code completion plugin for Python.

13 Jan 7, 2023

Comments

plugin?

Are you interested in a PR to add a plugin.json so this could be used either in headless mode on the command-line or via the UI inside BN itself which would let it be installable via the plugin manager?

opened by psifertex 2
Replace Counter.total() for users with python < 3.10

I'm running Binary Ninja on windows 10 and it's got Python 3.9.2, which means the Counter.total() function in calc_uncommon_instruction_sequences_score() doesn't work. I've replaced this with sum(counter.values()) which should do the same thing

opened by samrussell 1

Collection of scripts to pinpoint obfuscated code

Related tags

Overview

Obfuscation Detection (v1.0)

Description:

Usage

Note

Contact

You might also like...

A collection of models for image - text generation in ACM MM 2021.

An open collection of annotated voices in Japanese language

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Code examples for my Write Better Python Code series on YouTube.

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Code for the Python code smells video on the ArjanCodes channel.

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Code-autocomplete, a code completion plugin for Python

Comments

plugin?

Replace Counter.total() for users with python < 3.10

Releases(v1.4)

v1.4(Feb 23, 2022)

v1.3(Feb 14, 2022)

v1.2(Aug 14, 2021)

v1.1(Aug 10, 2021)

v1.0(Mar 5, 2021)

Owner

Tim Blazytko

Translate - a PyTorch Language Library

Natural Language Processing Best Practices & Examples

Simple python code to fix your combo list by removing any text after a separator or removing duplicate combos

GCRC: A Gaokao Chinese Reading Comprehension dataset for interpretable Evaluation

Deep Learning for Natural Language Processing - Lectures 2021

Honor's thesis project analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry.

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

This is a project built for FALLABOUT2021 event under SRMMIC, This project deals with NLP poetry generation.

Unsupervised Language Model Pre-training for French

Text-to-Speech for Belarusian language

Based on 125GB of data leaked from Twitch, you can see their monthly revenues from 2019-2021

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

📔️ Generate a text-based journal from a template file.

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Textpipe: clean and extract metadata from text

2021海华AI挑战赛·中文阅读理解·技术组·第三名

NLP, before and after spaCy

A list of NLP(Natural Language Processing) tutorials

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.