BERT-SST2-Prod

Reproduction process of BERT on SST2 dataset

安装说明

下载代码库

git clone https://github.com/JunnYu/BERT-SST2-Prod

进入文件夹，安装requirements

pip install -r requirements.txt

安装PaddlePaddle与PyTorch

# CPU版本的PaddlePaddle
pip install paddlepaddle==2.2.0 -i https://mirror.baidu.com/pypi/simple
# 如果希望安装GPU版本的PaddlePaddle，可以使用下面的命令
# pip install paddlepaddle-gpu==2.2.0.post112 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html
# 安装PyTorch
pip install torch==1.10.0+cu113 torchvision==0.11.1+cu113 torchaudio==0.10.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html

注意: 本项目依赖于paddlepaddle-2.2.0版本，安装时需要注意。

验证PaddlePaddle是否安装成功

运行python，输入下面的命令。

import paddle
paddle.utils.run_check()
print(paddle.__version__)

如果输出下面的内容，则说明PaddlePaddle安装成功。

PaddlePaddle is installed successfully! Let's start deep learning with PaddlePaddle now.
2.2.0

验证PyTorch是否安装成功

运行python，输入下面的命令，如果可以正常输出，则说明torch安装成功。

import torch
print(torch.__version__)
# 如果安装的是cpu版本，可以按照下面的命令确认torch是否安装成功
# 期望输出为 tensor([1.])
print(torch.Tensor([1.0]))
# 如果安装的是gpu版本，可以按照下面的命令确认torch是否安装成功
# 期望输出为 tensor([1.], device='cuda:0')
print(torch.Tensor([1.0]).cuda())

Reproduction process of BERT on SST2 dataset

Related tags

Overview

BERT-SST2-Prod

安装说明

Owner

yujun

A demo of chinese asr

Estimation of the CEFR complexity score of a given word, sentence or text.

Python-zhuyin - An open source Python library that provides a unified interface for converting between Chinese pinyin and Zhuyin (bopomofo)

DaCy: The State of the Art Danish NLP pipeline using SpaCy

This repository contains the code, models and datasets discussed in our paper "Few-Shot Question Answering by Pretraining Span Selection"

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

GNES enables large-scale index and semantic search for text-to-text, image-to-image, video-to-video and any-to-any content form

🌐 Translation microservice powered by AI

Datasets of Automatic Keyphrase Extraction

sangha, pronounced "suhng-guh", is a social networking, booking platform where students and teachers can share their practice.

Multiple implementations for abstractive text summurization , using google colab

FactSumm: Factual Consistency Scorer for Abstractive Summarization

Turn clang-tidy warnings and fixes to comments in your pull request

Generating new names based on trends in data using GPT2 (Transformer network)

Skipgram Negative Sampling in PyTorch

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Combating Embedding Barrier in Multilingual Models for Low-Resource Language Understanding".

Code for ACL 2021 main conference paper "Conversations are not Flat: Modeling the Intrinsic Information Flow between Dialogue Utterances".

Library for Russian imprecise rhymes generation