[AI6122] Text Data Management & Processing

Last update: Jan 17, 2022

Overview

[AI6122] Text Data Management & Processing

====== I M P O R T A N T ======

The content in this repository should exclusively be utilized in sharing solutions for projects, communicating ideas for related problems, and references to similar assignments. If you are a student facing an assignment with the same or similar topics, you can use this repository as a reference, while the final report should include the citations of the repository. If you submit an assignment without proper acknowledgment after referring to this repository, you may be considered PLAGIARISM by your instructor, and the author will not pay ANY responsibility for this. Please refer to your teacher's and your school's instructions for the determination of academic integrity.

Moreover, if you are taking the AI6122 course, do not be stupid. You can utilize the materials here as a reference to construct your own assignment and reflect the citation to this repository in the final report. If you copy the code without citing it, you have violated NTU's academic integrity and are involved in plagiarism.

Please refer to the following links for NTU's determination of academic integrity and plagiarism:

https://ts.ntu.edu.sg/sites/intranet/dept/tlpd/ai/Pages/NTU-Academic-Integrity-Policy.aspx

https://ts.ntu.edu.sg/sites/intranet/dept/tlpd/ai/Pages/default.aspx

https://ts.ntu.edu.sg/sites/policyportal/new/Documents/All%20including%20NIE%20staff%20and%20students/Student%20Academic%20Integrity%20Policy.pdf

If you think the professor is easy to fool, think again.
You know who you are.

====== D I S C L A I M E R ======

This repository should only be used for reasonable academic discussions. I, the owner of this repository, never and will never ALLOWING another student to copy this assignment as their own. In such circumstances, I do not violate NTU's statement on academic integrity as of the time this repository is open (18/01/2022). I am not responsible for any future plagiarism using the content of this repository.

====== I N T R O D U C T I O N ======

[AI6122] Text Data Management & Processing is an elective course of Master of Science in Artificial Intelligence Graduate Programme (MSAI), School of Computer Science and Engineering (SCSE), Nanyang Technological University (NTU), Singapore. The repository corresponds to the AI6122 of Semester 1, AY2021-2022, starting from 08/2021. The instructor of this course is Prof. Sun Aixin.

The projects of this course consist of one individual Literature Review, and one group Project. The topic of them are shown below, and we do not have the specific grade of them given by the prof. Since multiple people complete the group work, I do not have the right to disclose the report and others' codes individually so that the relevant parts will be hidden, and the group project only presents part of the code and report finished by myself.

Type	Topic	Grade
Literature Review	Chinese Spelling Check	N.A. / 30.0
Group Project	Data Analysis and Processing	N.A. / 40.0
Quiz	N.A.	N.A. / 30.0

====== A C K N O W L E D G E M E N T ======

All of above projects are designed by Prof. Sun Aixin.

[AI6122] Text Data Management & Processing

Related tags

Overview

[AI6122] Text Data Management & Processing

====== I M P O R T A N T ======

====== D I S C L A I M E R ======

====== I N T R O D U C T I O N ======

====== A C K N O W L E D G E M E N T ======

Owner

HT. Li

Deep learning with dynamic computation graphs in TensorFlow

R interface to fast.ai

SIR model parameter estimation using a novel algorithm for differentiated uniformization.

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

ONNX Command-Line Toolbox

Locationinfo - A script helps the user to show network information such as ip address

Gems & Holiday Package Prediction

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

A unified framework to jointly model images, text, and human attention traces.

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.

Learning Continuous Image Representation with Local Implicit Image Function

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

A PyTorch implementation of unsupervised SimCSE

A project for developing transformer-based models for clinical relation extraction

Code for "Long-tailed Distribution Adaptation"

Reading list for research topics in Masked Image Modeling

[CVPRW 2022] Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

Manipulation OpenAI Gym environments to simulate robots at the STARS lab

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

An efficient 3D semantic segmentation framework for Urban-scale point clouds like SensatUrban, Campus3D, etc.