First steps with Python in Life Sciences

Last update: Jan 08, 2023

Overview

First steps with Python in Life Sciences

This course material is part of the "First Steps with Python in Life Science" three-day course of SIB-training and is addressed to beginners wanting to become familiar with the Python syntax, environment, and the most common commands.

This course material provides an introduction to python and jupyter notebooks (a web based notebook system for creating and sharing computational documents) in an interactive manner.

prerequisite installation

You can find tips and instructions to ensure you have installed all the required software before starting the course.

course material organization

The course revolves around a sery of jupyter notebooks which take you on your first steps in you python journey.

Each jupyter notebook interleaves theory and examples of codes. We heartily recommend you execute and play around with these bits of code as you follow along : in programming, perhaps even more than anywhere else, practice makes perfect.

Additionally, each notebook is associated with a number of exercises (often in a separate notebook) of varying difficulty, with associated corrections.

If you are attending this course with a teacher (or if you are just curious), you can take a look at our schedule. In short, lessons 00 to 04 deals with generalistic aspect of the python language, while notebooks 05 or 08 present some of the most common modules used in data analysis and/or life sciences.

The notebooks/ folder contains each lesson:

00_jupyter_setup
01_python_basics
02_python_structures
03_reading_writing_files
04_modules
05_module_pandas : handle tabular data data-frames with pandas
06_module_matplotlib : create nice graphics and plots with matplotlib
07_module_biopython : do all kind of bioinformatics with [biopython]](https://biopython.org/)
08_module_numpy_and_scipy : fast numerical computations with numpy + a bit of statistics with scipy.stats

Exercise notebooks:

The data used in the practicals can be found in the data notebooks/data folder, and solutions codes can be found in the notebooks/solutions/ folder (NB: micro-exercises do not have a correction).

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Comments

Module 2-create your own functions - text columns

Your tutorials are fantastic! minor format issues: the multiple column format in some pages (ex: module 2 in python training) collapse the text and making it unreadable. Hope to see it fixed to complete the tutorial! thank you.

opened by catalicu 1

Releases(October2022)

October2022(Oct 12, 2022)

course material for the October 2022 edition of the SIB course "First Steps with Python in Life Sciences"
Source code(tar.gz)
Source code(zip)
May2022(May 12, 2022)

Release for the May2022 edition of the course in Basel
Source code(tar.gz)
Source code(zip)

First steps with Python in Life Sciences

Related tags

Overview

First steps with Python in Life Sciences

prerequisite installation

course material organization

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Statsmodels: statistical modeling and econometrics in Python

A computer algebra system written in pure Python

ForecastGA is a Python tool to forecast Google Analytics data using several popular time series models.

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

Hidden Markov Models in Python, with scikit-learn like API

Deep universal probabilistic programming with Python and PyTorch

Fast, flexible and easy to use probabilistic modelling in Python.

Comments

Module 2-create your own functions - text columns

Releases(October2022)

October2022(Oct 12, 2022)

May2022(May 12, 2022)

Owner

SIB Swiss Institute of Bioinformatics

A data structure that extends pyspark.sql.DataFrame with metadata information.

Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

PySpark Structured Streaming ROS Kafka ApacheSpark Cassandra

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

Maximum Covariance Analysis in Python

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

Evaluation of a Monocular Eye Tracking Set-Up

Python script for transferring data between three drives in two separate stages

Programmatically access the physical and chemical properties of elements in modern periodic table.

Tokyo 2020 Paralympics, Analytics

A Python module for clustering creators of social media content into networks

Data exploration done quick.

Analyzing Earth Observation (EO) data is complex and solutions often require custom tailored algorithms.

songplays datamart provide details about the musical taste of our customers and can help us to improve our recomendation system

API>local_db>AWS_RDS - Disclaimer! All data used is for educational purposes only.

A stock analysis app with streamlit

Active Learning demo using two small datasets

Intercepting proxy + analysis toolkit for Second Life compatible virtual worlds

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.