A repo for materials relating to the tutorial of CS-332 NLP
-
Tutorial 1:
- Introduction
- Corpus
- Regular expression
- Tokenization
-
Tutorial 2:
- Normalization
- Parsing
- Morpheme
- Stemming
- Lemmatization
-
Tutorial 3:
- What is ML?
- Types of ML algorithms?
- Supervised Learning
- Unsupervised Learning
- Evaluation measures
-
Tutorial 4:
- Feed forward NN
- Language modeling
- Speech and Language Processing. Daniel Jurafsky & James H. Martin. (Edition 2 & 3)
- Marcinkiewicz, M. A. (1994). Building a large annotated corpus of English: The Penn Treebank. Using Large Corpora, 273.
- http://su.diva-portal.org/smash/record.jsf?pid=diva2%3A686162&dswid=9114