Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

Overview

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

The codes for simulations were written in Fortran and compiled with the Intel Fortran Compiler. Data analysis and figures were done Python 3.10 and the following open source libraries: pandas, matplotlib and seaborn.

In this repository we show codes for simulations and processing data, as well as datasets used.

The preprint is available at https://arxiv.org/abs/2201.03476. The following BibTeX code can be used to cite it:

@misc{costa2022compartmental,
      title={Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil}, 
      author={Guilherme S. Costa and Wesley Cota and Silvio C. Ferreira},
      year={2022},
      eprint={2201.03476},
      archivePrefix={arXiv},
      primaryClass={q-bio.PE}
}

See also Effects of infection fatality ratio and social contact matrices on vaccine prioritization strategies and Outbreak diversity in epidemic waves propagating through distinct geographical scales.

Dictionaries

Municipalities :The files (a) dictES.csv and (b) dictPR.csv yield some information about municipalities of (a) ES (B) PR states. These files have six columns:

  1. ID: numeric key regarding calibration of confirmed cases time series
  2. ibgeID: official code to identify the city
  3. name: name of the city
  4. intermID: official code of intermediate region to which the city belongs
  5. imedID: official code of immediate region to which the city belongs
  6. totPop2019: population of the city estimated in 2019

Immediate and intermediate regions The files (a) dictImed.csv and (b) dictInterm.csv yield some information about (a) Immediate and (b) Intermediate regions of PR and ES. These files have five columns:

  1. ID: numeric key regarding calibration of confirmed cases time series
  2. imedID or \verb|intermID|: official code to identify the region
  3. name: name of the region
  4. state: state to which the region belongs
  5. totPop2019: population of the region estimated in 2019

States The file dictUF.csv yield some information about PR and ES states. These files have five columns:

  1. ID: numeric key regarding calibration of confirmed cases time series
  2. ibgeID: official code to identify the state
  3. name: name of the state
  4. uf: abbreviation of the state's name
  5. totPop2019: population of the state estimated in 2019

Time series

Cases and deaths: The files (a) PR.csv, (b) ES.csv, (c) saopaulo.csv and (d) manaus.csv yield the time series of confirmed cases and deaths since April 1, 2020 for (a) All cities of PR state, (b) All cities of ES state, (c) São Paulo city and (d) Manaus city. These files have seven columns:

  1. date: date
  2. ibgeID: official code to identify the city
  3. newCases: new confirmed cases on that day
  4. newDeaths: new confirmed deaths on that day
  5. city: name of the city
  6. totalCases: accumulated cases
  7. totalDeaths: accumulated deaths

Calibration: Within files (a) imed.zip and (b) state.zip we have the time series of accumulated cases and fatality ratio, aggregated for different geographical levels. In this, we have two types of files: casesXX.dat (XX refers to the calibrating IDs mentioned before) are accumulated cases while lethXX.dat are the daily fatalities).

Calibration Code

The file calibra.f90 is a program written in Fortran that executes the calibration algorithm described on Methods section of the main paper $1000$ times with different epidemiological parameters. This program has four inputs: the time series of accumulated cases and fatality, the initial date for calibration and the population of the region (state, city, etc). Besides that, this program has two output files: epiQuantities.dat and hiddenCompart.dat. The first has seven columns:

  1. Days from the initial time
  2. Calibrated confirmed cases
  3. Reference cases
  4. Effective reproductive number
  5. Fraction of susceptible population
  6. Underreporting coefficient
  7. Sample

On hiddenCompart.dat, we have time series for some compartments in the model: from left to right S, E, A, I, CA + CI, R + RI + RA + D and sample number.

Python scripts and figures

Calculation of underreporting coefficient: the file underreporting.ipynb is a I-python script that calculates the underreporting coefficient starting from a time series of confirmed cases and deaths. At the end, it exhibits a graphic showing the evolution of this coefficient.

Template for figures The majority of figures in this work were generated with matplotlib and seaborn packages of Python 3.7. File format_covid19br.mplstyle contains the template (font family and sizes) for generating those figures and graphics.

Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Image Deraining"

SAPNet This repository contains the official Pytorch implementation of the paper: "SAPNet: Segmentation-Aware Progressive Network for Perceptual Contr

11 Oct 17, 2022
Deep Learning Based Fasion Recommendation System for Ecommerce

Project Name: Fasion Recommendation System for Ecommerce A Deep learning based streamlit web app which can recommened you various types of fasion prod

BAPPY AHMED 13 Dec 13, 2022
The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

FAPIS The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter Introduction This repo is primari

Khoi Nguyen 8 Dec 11, 2022
Secure Distributed Training at Scale

Secure Distributed Training at Scale This repository contains the implementation of experiments from the paper "Secure Distributed Training at Scale"

Yandex Research 9 Jul 11, 2022
For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.

LongScientificFormer For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training. Some code

Athar Sefid 6 Nov 02, 2022
PyMatting: A Python Library for Alpha Matting

Given an input image and a hand-drawn trimap (top row), alpha matting estimates the alpha channel of a foreground object which can then be composed onto a different background (bottom row).

PyMatting 1.4k Dec 30, 2022
SE3 Pose Interp - Interpolate camera pose or trajectory in SE3, pose interpolation, trajectory interpolation

SE3 Pose Interpolation Pose estimated from SLAM system are always discrete, and

Ran Cheng 4 Dec 15, 2022
Automated Hyperparameter Optimization Competition

QQ浏览器2021AI算法大赛 - 自动超参数优化竞赛 ACM CIKM 2021 AnalyticCup 在信息流推荐业务场景中普遍存在模型或策略效果依赖于“超参数”的问题,而“超参数"的设定往往依赖人工经验调参,不仅效率低下维护成本高,而且难以实现更优效果。因此,本次赛题以超参数优化为主题,从真

20 Dec 09, 2021
A large-scale database for graph representation learning

A large-scale database for graph representation learning

Scott Freitas 29 Nov 25, 2022
Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Pytorch implementation of Relational Networks - A simple neural network module for relational reasoning Implemented & tested on Sort-of-CLEVR task. So

Kim Heecheol 800 Dec 05, 2022
The Rich Get Richer: Disparate Impact of Semi-Supervised Learning

The Rich Get Richer: Disparate Impact of Semi-Supervised Learning Preprocess file of the dataset used in implicit sub-populations: (Demographic groups

<a href=[email protected]"> 4 Oct 14, 2022
A Joint Video and Image Encoder for End-to-End Retrieval

Frozen️ in Time ❄️ ️️️️ ⏳ A Joint Video and Image Encoder for End-to-End Retrieval project page | arXiv | webvid-data Repository containing the code,

225 Dec 25, 2022
Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021

Contextualized Perturbation for Textual Adversarial Attack Introduction This is a PyTorch implementation of Contextualized Perturbation for Textual Ad

cookielee77 30 Jan 01, 2023
Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

Child-Tuning Source code for EMNLP 2021 Long paper: Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning. 1. Environ

46 Dec 12, 2022
This is a repository with the code for the ACL 2019 paper

The Story of Heads This is the official repo for the following papers: (ACL 2019) Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy

231 Nov 15, 2022
Flybirds - BDD-driven natural language automated testing framework, present by Trip Flight

Flybird | English Version 行为驱动开发(Behavior-driven development,缩写BDD),是一种软件过程的思想或者

Ctrip, Inc. 706 Dec 30, 2022
A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

Convolutional Neural Network (CNN). This repository contains a source code of a deep learning network built with TensorFlow and Keras to classify gend

Pawel Dziemiach 1 Dec 19, 2021
Learning to Reach Goals via Iterated Supervised Learning

Vanilla GCSL This repository contains a vanilla implementation of "Learning to Reach Goals via Iterated Supervised Learning" proposed by Dibya Gosh et

Christoph Heindl 4 Aug 10, 2022
Code for the paper Relation Prediction as an Auxiliary Training Objective for Improving Multi-Relational Graph Representations (AKBC 2021).

Relation Prediction as an Auxiliary Training Objective for Knowledge Base Completion This repo provides the code for the paper Relation Prediction as

Facebook Research 85 Jan 02, 2023
Learning to Map Large-scale Sparse Graphs on Memristive Crossbar

Release of AutoGMap:Learning to Map Large-scale Sparse Graphs on Memristive Crossbar For reproduction of our searched model, the Ubuntu OS is recommen

2 Aug 23, 2022