Reverse engineering the dengue virus (under development construction)

Overview

Reverse engineering the dengue virus (under development 🚧 )

What is dengue?

Dengue is a viral infection transmitted to humans through the bite of infected mosquitoes. The primary vectors that transmit the disease are Aedes aegypti mosquitoes and, to a lesser extent, Ae. albopictus. The virus responsible for causing dengue, is called dengue virus (DENV). There are four DENV serotypes and it is possible to be infected four times. While many DENV infections produce only mild illness, DENV can cause an acute flu-like illness. Occasionally this develops into a potentially lethal complication, called severe dengue. There is no specific treatment for dengue/severe dengue. Early detection of disease progression associated with severe dengue, and access to proper medical care lowers fatality rates of severe dengue to below 1%. Dengue is found in tropical and sub-tropical climates worldwide, mostly in urban and semi-urban areas. The global incidence of dengue has grown dramatically with about half of the world's population now at risk. Although an estimated 100-400 million infections occur each year, over 80% are generally mild and asymptomatic. - World Health Organization

What do I hope to achieve?

  • Discover patterns

  • Behaviour in terms of evolution for predictions

  • To gauge the similarities, similar characteristics compared to other viruses

The essence of how this works?

  • Each codon (triplet of nitrogenous base) is transcribed into an RNA sequence and is then translated into an amino acid
  • Specific triplets have specific amino acids
  • These translated amino acids perform specific functions ranging from protein synthesis to playing a role in immune response
  • By manipulating the nitrogen bases (AGCT) the code will be introducing mutations in the genetic code which will change the amino acid sequence and thus change the proteins involved in cellular or viral processes
  • The genome sequence I got of U.S. Department of Health & Human Services is RNA, no transcription?????
  • The genome sequence I got from US DOHHS is the DNA sequence, and they used a reverse transcriptase to convert RNA to cDNA - likely!

Code:

See here: notebook

🧬 Biology background info

The big ideas:

  • DNA and RNA are nucleic acids, that is, they are polymers of repeating monomers called nucleotides. Each nucleotide consists of a pentose sugar covalently bonded to a phosphate group and a nitrogenous base.
  • RNA is a single-stranded nucleic acid, it consists of a long chain of nucleotides covalently bonded together by phosphodiester bonds. In RNA the pentose sugar is ribose and the bases are A (adenine), U (uracil), C (cytosine) and G (guanine).
  • DNA is a double-stranded nucleic acid with a double-helix structure. The two strands are also made of long chains of nucleotides covalently bonded by phosphodiester bonds, however, the nucleotides of DNA are made of deoxyribose sugar and the nitrogenous bases A, T (thymine), C and G. The two strands are linked together by hydrogen bonds formed between complementary base pairs on opposite strands. A forms 2 hydrogen bonds with T and C forms 3 hydrogen bonds with G. They always pair up in this same combination. Note that the strands of DNA are antiparallel, they run alongside each other but in opposite orientation.
  • Protein synthesis is the process by which proteins are made in a cell from genetic information. DNA is too large to move out of the nucleus through the nuclear pores, so it transcribes the base sequence of a gene onto an RNA molecule called messenger RNA (or mRNA). Note that the mRNA molecule is made of the complementary bases to the DNA molecule, where U is complementary to A instead of T (eg If a section of DNA has AATC then the mRNA has UUAG). The mRNA carries the transcribed gene onto a ribosome in the cytoplasm of the cell. In the ribosome, another type of RNA called tRNA (transfer RNA) groups bases of the mRNA molecule in triplets called codons. Each group of 3 bases (or codon) codes for one amino acid. As a gene has many bases the tRNA will code for many amino acids which will bond together by peptide bonds to form a large polypeptide (or protein). Each gene codes for specific polypeptides which then fold into proteins and have diverse functions.
The fine details
  • RNA and DNA are both nucleic acids, meaning they are made of repeating subunits called nucleotides. These nucleotides are made of a pentose sugar covalently bonded to a phosphate group and a nitrogenous base. RNA is a single stranded molecule meaning it has one long chain of nucleotides bonded by covalent bonds and has bases A, U, C and G. Also RNA has ribose as the pentose sugar
  • mRNA is used in transcription where the genes of the DNA molecule are replicated onto an RNA molecule (but changing the base T in DNA for U in RNA) which then travels through the cytoplasm of the cell. In the ribosomes the tRNA basically gets groups of 3 bases where each three bases code for one amino acid and so a long chain of amino acids (protein) is made from one gene.
  • DNA is double stranded, has deoxyribose sugar and has a double helix shape. In DNA the 2 strands are held together by hydrogen bonds between the complementary bases on the 2 strands (A from one strand pairs with T from another strand and C from one strand pairs with G from another strand). They always pair up in this way. 2 hydrogen bonds are formed between A and T and 3 hydrogen bonds are formed between C and G and that’s what basically holds the double helix together.

Remember: T in DNA and U in RNA

  1. So you have 4 bases
  2. A and T pair up (In each strand) Of DNA
  3. C and G pair up (In each strand) Of DNA
  4. But in RNA U pairs up with A instead of T
  5. Genome sequence in link is DNA code which is a template - Its the DNA sequence, and they used a reverse transcriptase to convert RNA to cDNA
  6. The two strands of DNA separate. And mRNA bases attach to one of the DNA strands which is opposite to the one it wants to replicate. So if you wanna replicate A then mRNA will attach to T (so the mRNA base is T). They keep doing that until the desired code is replicated and they have a strand of RNA which goes to the ribosome and in ribosome. 3 bases are read at the same time. Those are the triplet codons. And an amino acid matching those is brought to them. Multiple amino acids together are bound by peptide bonds and that makes a protein

Codon table to see the code and see what amino acid that codes for: (DNA)

Note:

  • Multiple amino acids together are bound by peptide bonds and that makes a protein

Basically:

  • DNA is the template that stays inside the nucleus
  • And RNA is the single strand that travels out of the nucleus into the ribosome to code for proteins
  • RNA is a copy of DNA but a single strand
  • Essentially DNA codes for RNA and RNA codes for proteins. But instead of T as a base RNA has U
  • Codon table - multiple triplet codons code for 1 amino acid (eg AAT and AAC both code for leucine) to decrease the likelihood of mutations cos even if a base gets replaced (eg T by C) the same amino acid will be synthesized
  • There MANY types of RNA , theres mRNA, tRNA, snRNA , rRNA etc etc. but, when you’re picturing a strand similar to the dna one you r about mRNA. With mRNA you need to know that basically it derives from the actual dna strand with all the information , but there’s a little complication here. DNA consists of two strands, the coding strand with a 5’-3’ direction which actually contains all the information of the genome as it is, and then its complementary strand, the non-coding strand with a 3-5 direction. Now, the mrna is created FROM the non coding dna strand, that’s why it has the SAME bases (except T, it’s replaced by U) , SAME direction with the coding strand. ( as they are both complementary to the non coding strand ). After all, that’s why they called it ‘coding strand’ of DNA. It’s because it has the same base sequence with the mRNA molecule, which is the one that travels to the ribosomes and helps with protein synthesis etc etc. DNA is ‘transcribed’ into mRNA with a process called ‘transcription’. And badically what happens during this process is some enzymes line up nitrogenous bases of RNA ( adenine, guanine, cytosine and uracyl -remember in place of thymine) opposite the complementary bases of the non-coding dna strand ( the complementary strand of mRNA) . A-T and G-C are complementary. The mRNA is single stranded.
  • Gene sequence of the DNA, there are some regions between the sequences of the actual genes that are not going to be translated to proteins afterwards that are responsible for functions of the DNA , not for protein synthesis. After mRNA is transcribed, it also contains those regions that need to be cut off during a process called mRNA splicing. Enzymes inside the nucleus of the cell cut off those parts that ‘do not talk about proteins’ from the mRNA molecule, ( the introns ) and leave the exons, which are the actual genes. (The non-translating regions are the introns.)
  • The mRNA molecule leaves the cell, travels to the ribosomes and then the translation begins.
  • Each amino acid refers to a codon ( let me explain )
  • A codon is a sequence of 3 bases ( for example AUG- which codes for the amino acid methionin)
  • A gene on the mRNA might have 50 codons or 50 codons ( thus 50x3=150 bases ) . The mrna binds to the ribosome and another molecule , the tRNA , reads the codons, and brings the amino acid needed, and thus the polypeptide chain ( the protein) is created
  • What tf stop means, UAG, UGA and UAA are ‘stop codons’ which code for no amino acid but they ‘inform’ the ribosomes when the polypeptide chain is done and the translation is complete

The DNA is basically untouched ( not to worry about it,only unzips for transcription) . It stays there only opens up to be replicated but that’s it. DNA to RNA is transcription because then translation is when RNA codes for amino acids. You don’t make DNA out of RNA cos the DNA always stays in the nucleus and it’s been there since the cell formed

More technically:

  1. In transcription, the DNA sequence of a gene is "rewritten" in RNA. In eukaryotes, the RNA must go through additional processing steps to become a messenger RNA, or mRNA.
  2. In translation, the sequence of nucleotides in the mRNA is "translated" into a sequence of amino acids in a polypeptide (protein chain).

image

Codon table

image

(That is RNA, is what ultimately codes for the amino acid)

Our progress 🌊

Our todo list

Unanswered Questions

Tests

None yet

Precautions and/or solutions

NOT MEDICAL ADVICE. I AM NOT A DOCTOR.

Resources to learn more about genomes, DNA, RNA, transcription and more:

Websites

Books (dengue related, bioinformatics, vaccines, genome studies, creation of DNA molecules (synthesis), textbooks) and courses:

Owner
kjain
Kernel dev, machine learning practitioner. Makes commits way too inconsistently 🚩
kjain
Python script to decode the EU Covid-19 vaccine certificate

vacdec Python script to decode the EU Covid-19 vaccine certificate This script takes an image with a QR code of a vaccine certificate as the parameter

Hanno Böck 244 Nov 30, 2022
An api, written in Python, for Investopedia's paper trading stock simulator.

investopedia-trading-api An API, written in Python, for Investopedia's paper trading stock simulator. Pull requests welcome. This library is now Pytho

Kirk Thaker 178 Jan 06, 2023
Listen to the radio station from your favorite broadcast

Latest news Listen to the radio station from your favorite broadcast MyCroft Radio Skill for testing and copy at docker skill About Play regional radi

1 Dec 22, 2021
Program that automates the bump of the Disboard Bot. Done 100% in Python with PyAutoGUI library

Auto-Discord-Bump Program that automates the bump of the Disboard Bot done 100% in python with PyAutoGUI How to configue You will need 3 things before

Mateus 1 Dec 19, 2021
Automatically detect changes made to the official Telegram sites.

🕷 Telegram Web Crawler This project is developed to automatically detect changes made to the official Telegram sites. This is necessary for anticipat

Il'ya 115 Dec 31, 2022
A simple google translator telegram bot version 2

Translator-Bot-V2 A simple google translator telegram bot version 2 Made with Python3 (C) @FayasNoushad Copyright permission under MIT License License

Fayas Noushad 15 Oct 21, 2022
Wechat based auto reply with pyautogui

Python-微信 自动回复 练手~ 一直想做个给微信发个消息,就可以跑Python程序,并将结果发送给我的东西,之前看了 B站@不高兴就喝水 的视频,终于有了灵感~ 使用的是模拟点击方案,请求期间是不能操作了。 库 pyautogui 用于模拟鼠标键盘操作和定位操作位置 pyperclip 剪贴板

Vito Song 1 Oct 22, 2022
Discord bot to display private leaderboards for Advent of Code.

Advent Of Code Discord Bot Discord bot for displaying Advent of Code private leardboards, as well as custom leaderboards where participants can set th

The Future Gadgets Lab 6 Nov 29, 2022
Scratch2py or S2py is a easy to use, versatile tool to communicate with the Scratch API Based of Scratch2py

Scratch2py Scratch2py or S2py is a easy to use, versatile tool to communicate with the Scratch API Based of Scratch2py Installation Run this command i

2 Jan 13, 2022
Python script to backup/convert your Spotify playlists into the XSPF format.

Python script to backup/convert your Spotify playlists into the XSPF format.

Chris Ovenden 4 Jun 09, 2022
Automatically deploy freqtrade to a remote Docker host and auto update strategies.

Freqtrade Automatically deploy freqtrade to a remote Docker host and auto update strategies. I've been using it to automatically deploy to vultr, but

p-zombie 109 Jan 07, 2023
The best discord.py template with a changeable prefix

Discord.py Bot Template By noma4321#0035 With A Custom Prefix To Every Guild Function Features Has a custom prefix that is changeable for every guild

Noma4321 5 Nov 24, 2022
Discord-Lite - A light weight discord client written in Python, for developers, by developers.

Discord-Lite - A light weight discord client written in Python, for developers, by developers.

Sachit 142 Jan 07, 2023
Building and deploying AWS Lambda Shared Layers

AWS Lambda Shared Layers This repository is hosting the code from the following blog post: AWS Lambda & Shared layers for Python. The goal of this rep

Flaneer 7 Dec 02, 2021
Graviti-python-sdk - Graviti Data Platform Python SDK

Graviti Python SDK Graviti Python SDK is a python library to access Graviti Data

Graviti 13 Dec 15, 2022
Dribble sign up screen built in python and kivy

Dribble sign up screen built in python and kivy contains Dribble icon with icon position and shadow animation.

1 Dec 06, 2021
"Nesse projeto criei uma automação para abrir as tarefas no Jira em massa pegando de uma determinada fila do Zendesk."

automacao-Zendesk "Nesse projeto criei uma automação para abrir as tarefas no Jira em massa pegando de uma determinada fila do Zendesk." en-us "In thi

tokoyamy 1 Dec 20, 2021
Pure Python implementation of the Windows API method IDvdInfo2::GetDiscID.

pydvdid-m Pure Python implementation of the Windows API method IDvdInfo2::GetDiscID. This is a modification of sjwood's pydvdid. The Windows API metho

4 Nov 22, 2022
Discord bot that automatically fills out health screenings

Auto Covid Bot Automatically fill out the NYC DOE health screening form by registering with a discord bot School code can be found on https://schoolse

Cleo 2 Jul 29, 2022
google-resumable-media Apache-2google-resumable-media (🥉28 · ⭐ 27) - Utilities for Google Media Downloads and Resumable.. Apache-2

google-resumable-media Utilities for Google Media Downloads and Resumable Uploads See the docs for examples and usage. Experimental asyncio Support Wh

Google APIs 36 Nov 22, 2022