Deep learning algorithms for muon momentum estimation in the CMS Trigger System

The Compact Muon Solenoid (CMS) is a general-purpose detector at the Large Hadron Collider (LHC). During a run, it generates about 40 TB data per second. Since It is not feasible to readout and store such a vast amount of data, so a trigger system selects and stores only interesting events or events likely to reveal new physics phenomena. The goal of this project is to benchmark the muon momentum estimation performance of Fully Connected Neural Networks (FCNN), Convolutional Neural Networks (CNN), and Graph Neural Networks (GNN), on the prompt and displaced muon samples detected by CSC stations at CMS to aid trigger system's transverse momentum (pT) muon estimation.

About

In the project FCNNs, CNNs, and GNNs are trained and evaluated on the prompt muon samples (two versions of same samples with different sampling approaches), and displaced muon samples generated by Monte Carlo simulation. The other details are -

Target Variables: Three types of predictions are benchmarked with each type of algorithm.

Target	Loss
1/Transverse_momentum (1/pT)	Mean Square Error (MSE)
Transverse Momentum (pT)	$y\_true * (\frac{y\_true - y\_predicted}{y\_true})^{2}$
4 class classification (0-10 GeV, 10-30 GeV, 30-100 GeV, >100 GeV)	Focal Loss

Validation Scheme: 10 fold out-of-fold predictions (i.e. dataset is splitted into 10 small batches, out of them 8 are used for training, 1 as validation dataset and 1 as holdout. This holdout is changed 10 times to give the final scores.)
Metrices Tracked:
- MAE - Mean Absolute Error at a given transverse momentum (pT).
- MAE/pT - Ratio of Mean Absolute Error to transverse momentum at a given transverse momentum.
- Acurracy - At a given pT, muon samples can be divided into two classes, one muons with pT more than this given and another class of muons with pT less than this. So, Acurracy at a given pT is the accuracy for these two classes.
- F1-score (of class pT>x GeV) - At a given pT, this is the f1-score of the class of muons with pT more than this given pT.
- F1-score (of class pT - At a given pT, this is the f1-score of the class of muons with pT less than this given pT.

  Preprocessing: Standard scaling of input coordinates

 
How to use

Make sure that all the libraries mentioned in requirements.txt are installed
Clone the repo


https://github.com/lastnameis-borah/CMS_moun_transverse_momentum_estimation.git


Change current directory to the cloned directory and execute main.py with the required arguments


python main.py --path='/kaggle/input/cmsnewsamples/new-smaples.csv' \
                --dataset='prompt_new'\
                --predict='pT'\
                --model='FCNN'\
                --epochs=50 \
                --batch_size=512\
                --folds="0,1,2,3,4,5,6,7,8,9" \
                --results='/kaggle/working/results'
 Note: Give absolute paths as argument
 Arguments

path - path of the csv having the coordinates of generated muon samples
dataset - specify the samples that you are using (i.e. prompt_new, prompt_old, or displaced)
predict - target variable (i.e. pT, 1/pT, or pT_classes)
model - architecture to use (i.e. FCNN, CNN, or GNN)
epochs - max number of epochs to train, if score converges than due to early-stopping training may stop earlier
batchsize - number of samples in a batch
folds - a string containing the info on which folds one wants the result
results - path of the directory to save the results
 
Results
 
Regressing 1/pT



Metric
Prompt Muons Samples-1
Prompt Muons Samples-2
Displaced Muons Samples




MAE/pT





MAE





Accuracy





F1-score (pT>x)





F1-score (pT





 
Regressing pT



Metric
Prompt Muons Samples-1
Prompt Muons Samples-2
Displaced Muons Samples




MAE/pT





MAE





Accuracy





F1-score (pT>x)





F1-score (pT





 
Four class classification

Prompt Muons Samples-1




Model
0-10 GeV
10-30 GeV
30-100 GeV
>100GeV




FCNN
0.990
0.970
0.977
0.969


CNN
0.991
0.973
0.980
0.983




Prompt Muons Samples-2




Model
0-10 GeV
10-30 GeV
30-100 GeV
>100GeV




FCNN
0.990
0.975
0.981
0.958


CNN
0.991
0.976
0.983
0.983




Displaced Muons Samples




Model
0-10 GeV
10-30 GeV
30-100 GeV
>100GeV




FCNN
0.944
0.898
0.910
0.839


CNN
0.958
0.907
0.932
0.910

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

Related tags

Overview

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

About

How to use

Results

Regressing 1/pT

Regressing pT

Four class classification

Owner

anuragB

Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021

This is the official repository of XVFI (eXtreme Video Frame Interpolation)

Rule-based Customer Segmentation

Code related to the manuscript "Averting A Crisis In Simulation-Based Inference"

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

KaziText is a tool for modelling common human errors.

Implementation of Continuous Sparsification, a method for pruning and ticket search in deep networks

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

Official PyTorch Implementation of "AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting".

MPViT:Multi-Path Vision Transformer for Dense Prediction

Alignment Attention Fusion framework for Few-Shot Object Detection

BaseCls BaseCls 是一个基于 MegEngine 的预训练模型库，帮助大家挑选或训练出更适合自己科研或者业务的模型结构

Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

This is the code for CVPR 2021 oral paper: Jigsaw Clustering for Unsupervised Visual Representation Learning

Using LSTM write Tang poetry

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Tensorflow implementation of Swin Transformer model.

Alias-Free Generative Adversarial Networks (StyleGAN3) Official PyTorch implementation

rastrainer is a QGIS plugin to training remote sensing semantic segmentation model based on PaddlePaddle.