An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

Last update: Dec 06, 2022

Overview

PMR computer tutorials on HMMs (2021-2022)

This is a repository for computer tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

The tutorial consists of three parts:

HMM basics
HMM inference
HMM learning (with coding exercises), this tutorial contains a few code snippets for you to fill in
- A notebook with complete code is also provided here

Environment setup

Before you start with the tutorials you will first need to setup the environment on your preferred machine. The tutorials will use simple examples, hence any machine will do.

Setup on your machine

You'll need to open terminal on your machine and then follow the below instructions

Install git (linux, macOS, windows) to access the repository if you don't have it already
Clone the git repository on your machine by running git clone in the terminal (you can find a guide here)
Once you've cloned the repository, step into the directory by entering cd pmr2022-hmm into the terminal
If you don’t already have it also install miniconda (linux, macOS, windows), which will allow you to manage all python dependencies per project
You can now create the pmr conda environment by typing conda env create -f environment.yml. This step may take a while to complete since it has to download large binaries and you should better be connected to a good internet connection.

Starting the Jupyter server

Once you have the environment prepared you can start your jupyter notebook

Activate the conda environment with conda activate pmr
Now you will be able to start your jupyter server by typing jupyter notebook, which will start the server and open a browser to access the tutorial notebook. Click tutorial link in the browser window. You can stop the server by pressing Ctrl+c (or Cmd+c) in the terminal when you are done with it.

Google Colab

You can also access and run the notebooks on Google Colab directly via this link http://colab.research.google.com/github/vsimkus/pmr2022-hmm. More details can be found at https://colab.research.google.com/github/googlecolab/colabtools/blob/master/notebooks/colab-github-demo.ipynb#scrollTo=WzIRIt9d2huC.

Note that the Colab notebook environment should already include all the required dependencies, however, the versions may differ, hence the results may differ slightly from the provided solutions but that should not be a problem for this tutorial.

Attributions

The tutorials in this repository were authored by Yao Fu and Shangmin Guo in discussion with Michael Gutmann, and edited by Vaidotas Šimkus.

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

Related tags

Overview

PMR computer tutorials on HMMs (2021-2022)

Environment setup

Setup on your machine

Starting the Jupyter server

Google Colab

Attributions

Owner

Vaidotas Šimkus

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

Beta Distribution Guided Aspect-aware Graph for Aspect Category Sentiment Analysis with Affective Knowledge. Proceedings of EMNLP 2021

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Python library for processing Chinese text

Speech Recognition for Uyghur using Speech transformer

Ongoing research training transformer language models at scale, including: BERT & GPT-2

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

Just Another Telegram Ai Chat Bot Written In Python With Pyrogram.

DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

AI-powered literature discovery and review engine for medical/scientific papers

Data loaders and abstractions for text and NLP

IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)

☀️ Measuring the accuracy of BBC weather forecasts in Honolulu, USA

Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Script to download some free japanese lessons in portuguse from NHK

Automatic privilege escalation for misconfigured capabilities, sudo and suid binaries

中文問句產生器；使用台達電閱讀理解資料集(DRCD)