Code repository for the paper Computer Vision User Entity Behavior Analytics

Last update: Aug 20, 2022

Related tags

Deep Learning CVUEBA

Overview

Computer Vision User Entity Behavior Analytics

Code repository for "Computer Vision User Entity Behavior Analytics"

Code Description

dataset.csv

As discussed in the manuscript, CVUEBA was designed to be utilized in production. Thus, as an extra layer of security, we keep the features used as well as the feature extraction module proprietary.

We observed that one can obtain similar performance on the CERT Insider Threat dataset using a combination of features introduced by various publications in concert with the features we introduce in the main manuscript.

dataset.csv is a CSV file containing the extracted features for various users for various days in the CERT Insider Threat dataset. For space reasons, we publish a small segment of the original dataset here. Reported instances were chosen by randomly selecting from the set of encoded images used to evaluate CVUEBA and storing unique behavior instances corresponding to the channels of these images.

We did not wish for all of the code to be proprietary, and thus felt this was an acceptable compromise.

split_dataset.py

Splits dataset into train, test, and validation sets.

sae_hopt.py & SAE.hyperopt

This script is used for hyperparameter search for the SAE model using the HyperOpt module. Results of tuning are stored within SAE.hyperopt.

SAE.py

Defines the SAE model. Optimal hyperparameters are determined as shown in the script sae_hopt.py.

generate_images.py

Trains the SAE model using optimal parameters stored in SAE.hyperopt if a trained model is not present. Uses this model to generate color image encodings of behavior.

extract_non_dynamic.py and nondynamic.pkl

CVUEBA uses non-dynamic information to improve model precision. This script extracts the information from the CERT Insider Threat dataset and stores it within nondynamic.pkl.

To execute this script you would need to download the CERT Insider Threat dataset. For demo purposes, we provide a pre-extracted pickle file in the repo.

prep_data_model.py

This is a custom data loader that uses the image directory name and nondynamic.pkl to pull the information to be passed into the CVUEBA model.

CVUEBA.py

Loads train and test set data, builds CVUEBA model, trains and saves model, and reports evaluation metrics.

How To Use

We provide a requirements.txt file that lists all dependencies required to run the demo.

The script run.sh is provided to execute all the various python scripts in order to split data, generate images, and evaluate CVUEBA.

Code repository for the paper Computer Vision User Entity Behavior Analytics

Related tags

Overview

Computer Vision User Entity Behavior Analytics

Code Description

dataset.csv

split_dataset.py

sae_hopt.py & SAE.hyperopt

SAE.py

generate_images.py

extract_non_dynamic.py and nondynamic.pkl

prep_data_model.py

CVUEBA.py

How To Use

Owner

Sameer Khanna

Towhee is a flexible machine learning framework currently focused on computing deep learning embeddings over unstructured data.

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

sktime companion package for deep learning based on TensorFlow

Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis

A full pipeline AutoML tool for tabular data

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."

With this package, you can generate mixed-integer linear programming (MIP) models of trained artificial neural networks (ANNs) using the rectified linear unit (ReLU) activation function

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks

Unsupervised Image Generation with Infinite Generative Adversarial Networks

Robust Partial Matching for Person Search in the Wild

Code for generating the figures in the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?"

Image-to-Image Translation in PyTorch

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

TakeInfoatNistforICS - Take Information in NIST NVD for ICS

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Script that receives an Image (original) and a set of images to be used as "pixels" in reconstruction of the Original image using the set of images as "pixels"

Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.

Code for the CVPR2022 paper "Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity"

Real-time VIBE: Frame by Frame Inference of VIBE (Video Inference for Human Body Pose and Shape Estimation)