An open source library for deep learning end-to-end dialog systems and chatbots.

Last update: Dec 30, 2022

Overview

DeepPavlov is an open-source conversational AI library built on TensorFlow, Keras and PyTorch.

DeepPavlov is designed for

development of production ready chat-bots and complex conversational systems,
research in the area of NLP and, particularly, of dialog systems.

Quick Links

Demo demo.deeppavlov.ai
Documentation docs.deeppavlov.ai
- Model List docs:features/
- Contribution Guide docs:contribution_guide/
Issues github/issues/
Forum forum.deeppavlov.ai
Blogs medium.com/deeppavlov
Tutorials examples/ and extended colab tutorials
Docker Hub hub.docker.com/u/deeppavlov/
- Docker Images Documentation docs:docker-images/

Please leave us your feedback on how we can improve the DeepPavlov framework.

Models

Named Entity Recognition | Slot filling

Intent/Sentence Classification | Question Answering over Text (SQuAD)

Knowledge Base Question Answering

Morphological tagging | Syntactic parsing

Automatic Spelling Correction | ELMo training and fine-tuning

Speech recognition and synthesis (ASR and TTS) based on NVIDIA NeMo

Entity Linking | Multitask BERT

Skills

Goal(Task)-oriented Bot | Seq2seq Goal-Oriented bot

Open Domain Questions Answering | eCommerce Bot

Frequently Asked Questions Answering | Pattern Matching

Embeddings

BERT embeddings for the Russian, Polish, Bulgarian, Czech, and informal English

ELMo embeddings for the Russian language

FastText embeddings for the Russian language

Auto ML

Tuning Models with Evolutionary Algorithm

Integrations

REST API | Socket API | Yandex Alice

Telegram | Microsoft Bot Framework

Amazon Alexa | Amazon AWS

Installation

We support Linux and Windows platforms, Python 3.6 and Python 3.7
- Python 3.5 is not supported!
- installation for Windows requires Git(for example, git) and Visual Studio 2015/2017 with C++ build tools installed!

Create and activate a virtual environment:

Linux

python -m venv env
source ./env/bin/activate

Windows

python -m venv env
.\env\Scripts\activate.bat

Install the package inside the environment:
```
pip install deeppavlov
```

QuickStart

There is a bunch of great pre-trained NLP models in DeepPavlov. Each model is determined by its config file.

List of models is available on the doc page in the deeppavlov.configs (Python):

from deeppavlov import configs

When you're decided on the model (+ config file), there are two ways to train, evaluate and infer it:

via Command line interface (CLI) and
via Python.

GPU requirements

To run supported DeepPavlov models on GPU you should have CUDA 10.0 installed on your host machine and TensorFlow with GPU support (tensorflow-gpu) installed in your python environment. Current supported TensorFlow version is 1.15.2. Run

pip install tensorflow-gpu==1.15.2

before installing model's package requirements to install supported tensorflow-gpu version.

Before making choice of an interface, install model's package requirements (CLI):

python -m deeppavlov install <config_path>

where <config_path> is path to the chosen model's config file (e.g. deeppavlov/configs/ner/slotfill_dstc2.json) or just name without .json extension (e.g. slotfill_dstc2)

Command line interface (CLI)

To get predictions from a model interactively through CLI, run

python -m deeppavlov interact <config_path> [-d]

-d downloads required data -- pretrained model files and embeddings (optional).

You can train it in the same simple way:

python -m deeppavlov train <config_path> [-d]

Dataset will be downloaded regardless of whether there was -d flag or not.

To train on your own data you need to modify dataset reader path in the train config doc. The data format is specified in the corresponding model doc page.

There are even more actions you can perform with configs:

python -m deeppavlov <action> <config_path> [-d]

<action> can be
- download to download model's data (same as -d),
- train to train the model on the data specified in the config file,
- evaluate to calculate metrics on the same dataset,
- interact to interact via CLI,
- riseapi to run a REST API server (see doc),
- telegram to run as a Telegram bot (see doc),
- msbot to run a Miscrosoft Bot Framework server (see doc),
- predict to get prediction for samples from stdin or from <file_path> if -f <file_path> is specified.
<config_path> specifies path (or name) of model's config file
-d downloads required data

Python

To get predictions from a model interactively through Python, run

from deeppavlov import build_model

model = build_model(<config_path>, download=True)

# get predictions for 'input_text1', 'input_text2'
model(['input_text1', 'input_text2'])

where download=True downloads required data from web -- pretrained model files and embeddings (optional),
<config_path> is path to the chosen model's config file (e.g. "deeppavlov/configs/ner/ner_ontonotes_bert_mult.json") or deeppavlov.configs attribute (e.g. deeppavlov.configs.ner.ner_ontonotes_bert_mult without quotation marks).

You can train it in the same simple way:

from deeppavlov import train_model 

model = train_model(<config_path>, download=True)

download=True downloads pretrained model, therefore the pretrained model will be, first, loaded and then train (optional).

Dataset will be downloaded regardless of whether there was -d flag or not.

To train on your own data you need to modify dataset reader path in the train config doc. The data format is specified in the corresponding model doc page.

You can also calculate metrics on the dataset specified in your config file:

from deeppavlov import evaluate_model 

model = evaluate_model(<config_path>, download=True)

There are also available integrations with various messengers, see Telegram Bot doc page and others in the Integrations section for more info.

Breaking Changes

Breaking changes in version 0.7.0

in dialog logger config file dialog_logger_config.json agent_name parameter was renamed to logger_name, the default value was changed
Agent, Skill, eCommerce Bot and Pattern Matching classes were moved to deeppavlov.deprecated
AIML Skill, RASA Skill, Yandex Alice, Amazon Alexa, Microsoft Bot Framework and Telegram integration interfaces were changed
/start and /help Telegram messages were moved from models_info.json to server_config.json
risesocket request and response format was changed
riseapi and risesocket model-specific properties parametrization was changed

Breaking changes in version 0.6.0

REST API:
- all models default endpoints were renamed to /model
- by default model arguments names are taken from chainer.in configuration parameter instead of pre-set names from a settings file
- swagger api endpoint moved from /apidocs to /docs
when using "max_proba": true in a proba2labels component for classification, it will return single label for every batch element instead of a list. One can set "top_n": 1 to get batches of single item lists as before

Breaking changes in version 0.5.0

dependencies have to be reinstalled for most pipeline configurations
models depending on tensorflow require CUDA 10.0 to run on GPU instead of CUDA 9.0
scikit-learn models have to be redownloaded or retrained

Breaking changes in version 0.4.0!

default target variable name for neural evolution was changed from MODELS_PATH to MODEL_PATH.

Breaking changes in version 0.3.0!

component option fit_on_batch in configuration files was removed and replaced with adaptive usage of the fit_on parameter.

Breaking changes in version 0.2.0!

utils module was moved from repository root in to deeppavlov module
ms_bot_framework_utils,server_utils, telegram utils modules was renamed to ms_bot_framework, server and telegram correspondingly
rename metric functions exact_match to squad_v2_em and squad_f1 to squad_v2_f1
replace dashes in configs name with underscores

Breaking changes in version 0.1.0!

As of version 0.1.0 all models, embeddings and other downloaded data for provided configurations are by default downloaded to the .deeppavlov directory in current user's home directory. This can be changed on per-model basis by modifying a ROOT_PATH variable or related fields one by one in model's configuration file.
In configuration files, for all features/models, dataset readers and iterators "name" and "class" fields are combined into the "class_name" field.
deeppavlov.core.commands.infer.build_model_from_config() was renamed to build_model and can be imported from the deeppavlov module directly.
The way arguments are passed to metrics functions during training and evaluation was changed and documented.

License

DeepPavlov is Apache 2.0 - licensed.

The Team

DeepPavlov is built and maintained by Neural Networks and Deep Learning Lab at MIPT.

Comments

Regarding Spelling Error model
Thanks for amazing toolkit :) Can you please share your views on below questions

How does correct_prior & incorrect_prior calculation done in Error model ?

How do we incorporate "count" with incorrect-correct pair e.g. if training data is in form of (intended_word, observed_word, count).

Is there any other way we can combine LM score & EM score in LM beam search method ?

Thanks a lot !!
opened by smilenrhyme 26
Error while trying to get the probablities of the predicted entities using ontonotes_bert ner model
Deeppavlov version: 0.12.1 Python version: 3.7.7 "return_probas"set to true in ner_ontonotes_bert config json Command: python -m deeppavlov interact F:\miniconda3\envs\ute_query_params_service\Lib\site-packages\deeppavlov\configs\ner\ner_ontonotes_bert.js Command string: London is in England. Full Traceback: [nltk_data] Downloading package punkt to [nltk_data] C:\Users\User\AppData\Roaming\nltk_data... [nltk_data] Package punkt is already up-to-date! [nltk_data] Downloading package stopwords to [nltk_data] C:\Users\User\AppData\Roaming\nltk_data... [nltk_data] Package stopwords is already up-to-date! [nltk_data] Downloading package perluniprops to [nltk_data] C:\Users\User\AppData\Roaming\nltk_data... [nltk_data] Package perluniprops is already up-to-date! [nltk_data] Downloading package nonbreaking_prefixes to [nltk_data] C:\Users\User\AppData\Roaming\nltk_data... [nltk_data] Package nonbreaking_prefixes is already up-to-date! WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\bert_dp\tokenization.py:125: The name tf.gfile.GFile is deprecated. Please use tf.io.gfile.GFile instead.

2020-11-30 21:37:31.122 INFO in 'deeppavlov.core.data.simple_vocab'['simple_vocab'] at line 115: [loading vocabulary from C:\Users\User.deeppavlov\models\ner_ontonotes_bert\tag.dict] WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:37: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:222: The name tf.placeholder is deprecated. Please use tf.compat.v1.placeholder instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:222: The name tf.train.Optimizer is deprecated. Please use tf.compat.v1.train.Optimizer instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:193: The name tf.train.AdamOptimizer is deprecated. Please use tf.compat.v1.train.AdamOptimizer instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\models\bert\bert_sequence_tagger.py:236: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\models\bert\bert_sequence_tagger.py:314: The name tf.placeholder_with_default is deprecated. Please use tf.compat.v1.placeholder_with_default instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\bert_dp\modeling.py:178: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\bert_dp\modeling.py:418: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\bert_dp\modeling.py:499: The name tf.assert_less_equal is deprecated. Please use tf.compat.v1.assert_less_equal instead.

WARNING:tensorflow: The TensorFlow contrib module will not be included in TensorFlow 2.0. For more information, please see:

https://github.com/tensorflow/community/blob/master/rfcs/20180907-contrib-sunset.md

https://github.com/tensorflow/addons

https://github.com/tensorflow/io (for I/O related ops) If you depend on functionality not listed there, please file an issue.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\bert_dp\modeling.py:366: calling dropout (from tensorflow.python.ops.nn_ops) with keep_prob is deprecated and will be removed in a future version. Instructions for updating: Please use rate instead of keep_prob. Rate should be set to rate = 1 - keep_prob. WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\bert_dp\modeling.py:680: dense (from tensorflow.python.layers.core) is deprecated and will be removed in a future version. Instructions for updating: Use keras.layers.Dense instead. WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\tensorflow_core\python\layers\core.py:187: Layer.apply (from tensorflow.python.keras.engine.base_layer) is deprecated and will be removed in a future version. Instructions for updating: Please use layer.__call__ method instead. WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\bert_dp\modeling.py:283: The name tf.erf is deprecated. Please use tf.math.erf instead.

WARNING:tensorflow:Variable *= will be deprecated. Use var.assign(var * other) if you want assignment to the variable value or x = x * y if you want a new python Tensor object. WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\models\bert\bert_sequence_tagger.py:75: where (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. Instructions for updating: Use tf.where in 2.0, which has the same broadcast rule as np.where WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\tensorflow_core\contrib\crf\python\ops\crf.py:213: dynamic_rnn (from tensorflow.python.ops.rnn) is deprecated and will be removed in a future version. Instructions for updating: Please use keras.layers.RNN(cell), which is equivalent to this API WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:234: The name tf.train.AdadeltaOptimizer is deprecated. Please use tf.compat.v1.train.AdadeltaOptimizer instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:131: The name tf.get_collection is deprecated. Please use tf.compat.v1.get_collection instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:131: The name tf.GraphKeys is deprecated. Please use tf.compat.v1.GraphKeys instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:94: The name tf.global_variables is deprecated. Please use tf.compat.v1.global_variables instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\tensorflow_core\python\training\moving_averages.py:433: Variable.initialized_value (from tensorflow.python.ops.variables) is deprecated and will be removed in a future version. Instructions for updating: Use Variable.read_value. Variables in 2.X are initialized automatically both in eager and graph (inside tf.defun) contexts. WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\models\bert\bert_sequence_tagger.py:671: The name tf.assign is deprecated. Please use tf.compat.v1.assign instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\models\bert\bert_sequence_tagger.py:244: The name tf.global_variables_initializer is deprecated. Please use tf.compat.v1.global_variables_initializer instead.

WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\models\bert\bert_sequence_tagger.py:249: checkpoint_exists (from tensorflow.python.training.checkpoint_management) is deprecated and will be removed in a future version. Instructions for updating: Use standard file APIs to check for files with this prefix. 2020-11-30 21:40:57.982 INFO in 'deeppavlov.core.models.tf_model'['tf_model'] at line 51: [loading model from C:\Users\User.deeppavlov\models\ner_ontonotes_bert\model] WARNING:tensorflow:From f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\models\tf_model.py:54: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead.

x::London is in England. Traceback (most recent call last): File "f:\miniconda3\envs\ute_query_params_service\lib\runpy.py", line 193, in _run_module_as_main "main", mod_spec) File "f:\miniconda3\envs\ute_query_params_service\lib\runpy.py", line 85, in run_code exec(code, run_globals) File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov_main.py", line 4, in main() File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\deep.py", line 89, in main interact_model(pipeline_config_path) File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\commands\infer.py", line 89, in interact_model pred = model(*args) File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\common\chainer.py", line 207, in call return self._compute(*args, param_names=self.in_x, pipe=self.pipe, targets=self.out_params) File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\common\chainer.py", line 230, in _compute res = component.call(*x) File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 92, in call looked_up_batch = [self(sample, is_top=False) for sample in batch] File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 92, in looked_up_batch = [self(sample, is_top=False) for sample in batch] File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 92, in call looked_up_batch = [self(sample, is_top=False) for sample in batch] File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 92, in looked_up_batch = [self(sample, is_top=False) for sample in batch] File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 92, in call looked_up_batch = [self(sample, is_top=False) for sample in batch] File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 92, in looked_up_batch = [self(sample, is_top=False) for sample in batch] File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 94, in call return self[batch] File "f:\miniconda3\envs\ute_query_params_service\lib\site-packages\deeppavlov\core\data\simple_vocab.py", line 161, in getitem raise NotImplementedError("not implemented for type {}".format(type(key))) NotImplementedError: not implemented for type <class 'numpy.float32'>

I would like to get the probablities of predicted entities. Is return_probas is the way to go? Please help. Thanks in advance.

Ravishankar
opened by ravishpankar 20
Data set creation routine for gobot DSTC 2 format
Hi,

I want to create data set creation routine for gobot DSTC 2 format. I know that that there is an on going refactoring of the codebase for the Goal-oriented bot (gobot).

Also, there is a new DSTC 8 challenge and Alexa Prize socialbot which is to be open sourced.

So I want to ask if this feature would be needed or is it duplication of work?

Ideally, I want to pull the routine to the deeppavlov repo, so I need some guidance/advice before jumping into the implementation.

Things I want to clarify:

Is this routine needed to be developed? Or is it already underway and it would be duplication of work?

What format would be best (DSTC 2 json, DSTC 8, etc)?

I want to create CLI with python, is it good?

Anything else you think might be appropriate.
feature request
opened by Eugen2525 17
NER fine-tuninng using pretrained using ner_ontonotes_bert_mult

I am trying to use Bert for NER recognition of standard and some custom entities. For instance, I have the following entities: 'financial_instrument', 'amount', 'percent', 'hashtag', 'exchange', 'number', 'sector', 'period', 'location', 'media_type', 'analyst', 'ticker', 'person', 'price_movement', 'rating_agency', 'product', 'amount_price_target', 'financial_topic', 'publication', 'company', 'event' and some of them I can map to the existing ones in BIO format, such as: EVENT, PERCENT, NUMBER, GPE, PERIOD, NORG, etc. However, some of my entities are new and cannot be mapped. I was wondering if I can use the pretrained ner_ontonotes_bert_mult and just add my specific entities to fine-tune the model? Is this possible and could you provide to me sample code?

Tnx

opened by igormis 17
deeppavlov.core.common.errors.ConfigError: 'Given fasttext model does NOT match fasttext model used previously to train loaded model'

I goy the following error

Traceback (most recent call last): File "deep.py", line 63, in main() File "deep.py", line 55, in main interact_model_by_telegram(pipeline_config_path, token) File "/home/mdixie/.conda/envs/deeppavlov/lib/python3.6/site-packages/deeppavlov-0.0.1-py3.6.egg/telegram_utils/telegram_ui.py", line 57, in interact_model_by_telegram model = build_model_from_config(config) File "/home/mdixie/.conda/envs/deeppavlov/lib/python3.6/site-packages/deeppavlov-0.0.1-py3.6.egg/deeppavlov/core/commands/infer.py", line 34, in build_model_from_config model = from_params(REGISTRY[model_name], model_config, vocabs=vocabs, mode=mode) File "/home/mdixie/.conda/envs/deeppavlov/lib/python3.6/site-packages/deeppavlov-0.0.1-py3.6.egg/deeppavlov/core/common/params.py", line 49, in from_params mode=kwargs['mode']) File "/home/mdixie/.conda/envs/deeppavlov/lib/python3.6/site-packages/deeppavlov-0.0.1-py3.6.egg/deeppavlov/core/common/params.py", line 52, in from_params model = cls(**dict(config_params, **kwargs)) File "/home/mdixie/.conda/envs/deeppavlov/lib/python3.6/site-packages/deeppavlov-0.0.1-py3.6.egg/deeppavlov/core/models/tf_backend.py", line 47, in call obj.init(*args, **kwargs) File "/home/mdixie/.conda/envs/deeppavlov/lib/python3.6/site-packages/deeppavlov-0.0.1-py3.6.egg/deeppavlov/core/models/tf_backend.py", line 28, in _wrapped return func(*args, **kwargs) File "/home/mdixie/.conda/envs/deeppavlov/lib/python3.6/site-packages/deeppavlov-0.0.1-py3.6.egg/deeppavlov/models/classifiers/intents/intent_model.py", line 138, in init "Given fasttext model does NOT match fasttext model used previously to train loaded model") deeppavlov.core.common.errors.ConfigError: 'Given fasttext model does NOT match fasttext model used previously to train loaded model'

opened by dixiematt8 16
Bert for classification

Здравствуйте! Я новичок в deep learning, хотела бы претренированную модель руберт использовать для своих данных. Задача обычная классификация русских текстов - 5 классов. Работаю на гугл колаб. Есть ли у вас туториалы пошагово как применять bert от deeppavlov для собаственных данных? как менять конфиг файл? Для примера взяла rusentiment_bert.json, но не могу разобраться как привести свои данные в тот формат, который требуется для MODELS_PATH. Спасибо!

opened by Aygera 15
building go-bot in russian

Hi! I want to build a go-bot using DeepPavlov in russian. The task of gobot is to output phone number of requested employee by his name, surname, fathers name. I plan to use tutorial03 as a reference. And the main idea is using instead of DSTC2 data set a new one, which i gonna generate in DSTC2 format. Has the described aproach a right to exist?
help wanted

opened by vitalyuf 15

Upgrading Tensorflow to 1.15.4 in tf.txt

What problem are we trying to solve?:

There are many security risks around TensorFlow version 1.15.2. These issues have been fixed in version 1.15.4 and newer versions. Newer versions have also been optimized better than 1.15.2.

How can we solve it?:

Upgrading the TensorFlow to 1.15.4 in the deeppavlov>>requirements>>tf.txt from tensorflow==1.15.2 to tensorflow==1.15.4.

Are there other issues that block this solution?:

Missing out on the optimization and security enhancements.

Any important aspect to consider?:

Making sure that upgrading the TensorFlow to 1.15.4 causes no issue in syntaxes and yields the same results as 1.15.2.

enhancement

opened by Rajmehta123 12

Library does't see GPU
Hi everyone, thanks for your library! I use several BERT models, but I can't train them using GPU. I describe all process:

I install Deeppavlov package into docker container

I install tensorflow-gpu: pip install tensorflow-gpu==1.14.0

I install model’s package requirements and download model

I move docker container to another machine with acсess to GPU. This machine has CUDA and cudnn. But when I train model, it uses CPU. I try to check access to GPU using this command: tf.test_is_gpu_avalaible. It returns me False( May be there is a mistake in this sequence of actions?
opened by ostreech1997 12
feat: Imdb sentiment dataset reader

This PR implements a dataset reader for the IMDb sentiment classification dataset. It also includes a json configuration for BERT (en, cased) which is mostly the same as the configuration for rusentiment except for the max seq length and batch size (which I set to values such that I don't get out-of-memory on my hardware).

This PR also includes a fix for the sets_accuracy metric which should now correctly work for string labels (i.e. wrap them into sets instead converting them to sets). Also I added reporting of cached files in download_decompress.

opened by sgrechanik-h 12
ODQA inference speed very very slow

Running the default configuration and model on a EC2 p2.xlarge instance (60~GB Ram and Nvidia K80 GPU) and inference for simple questions take 40 seconds to 5 minutes.

Sometimes, no result even after 10 minutes.

opened by shubhank008 12
add fusion in decoder

Файл c кодом модели полностью перекопирован из репозитория FiD, мне показалось что ради одного файла добавлять целый репозиторий не стоит, но мне не очевдно насколько это правильное решение

opened by LogicZMaksimka 0
👩‍💻📞DeepPavlov Community Call #18

Привет, друзья!

Мы рады вернуться в этом месяце с DeepPavlov Community Call на русском языке. На предстоящем вебинаре к нам придет приглашенный гость Борис Галицкий, ассоциированный сотрудник лаборатории Интеллектуальных систем и структурного анализа НИУ ВШЭ, основатель нескольких стартапов в области ИИ, профессор ANECA, а также бывший сотрудник Oracle, представит доклад на тему “Дискурсивный анализ текста для организации диалога”.

Сделать диалог с чат-ботом логичным и интересным — важнейшая задача области Conversational AI. Для этого применяются самые разные подходы, и один из них — дискурсивный анализ текста. Его идея состоит в том, чтобы чат-бот помог пользователю сфокусироваться лишь на каком-либо предложении из всего текста. В дискурсивном дереве текст разбивается на части, связанные логическими отношениями, и чат-бот направляет по ним пользователя, развивая диалог. Например, это могут быть временные (temporal) отношения, когда пользователя наверняка заинтересует, что будет после описанного события или что было до него. На нашем вебинаре Борис Галицкий подробно расскажет о способе управления ходом диалога в чат-боте на основе дискурсивного анализа текста.

DeepPavlov Community Call #11, Русская версия (27 июля, 2022) Мы проведем следующий звонок 27 июля 2022 в 19.00 по Московскому времени (19 MSK). Добавьте напоминание в календарь: http://bit.ly/MonthlyDPCommunityCall2021Ru

Повестка DeepPavlov Community Call #18, Русская версия:

7:00pm–7:10pm | Приветствие 7:10 –7:45pm | Борис Галицкий: Дискурсивный анализ текста для организации диалога 7:45pm–8:00pm | Вопросы и ответы с Борисом Галицким и командой инженеров DeepPavlov

В случае, если вы пропустили Community Calls ранее, вы всегда их можете найти в плейлисте.

Мы приглашаем вас присоединиться к нам, чтобы сообщить, что вы думаете о последних изменениях, поделиться своими ожиданиями от предстоящей версии библиотеки и рассказать, как DeepPavlov помогает вам в ваших проектах!

Оставьте отзыв о библиотеке DeepPavlov

Мы хотим услышать вас. Вы можете заполнить форму ниже, чтобы сообщить нам, как вы используете DeepPavlov Library, что вы хотите, чтобы мы добавили или улучшили! http://bit.ly/DPLibrary2021Survey

Заинтересовались? Не упускайте шанс и присоединяйтесь к нам! Этот Call открыт для всех энтузиастов в области Conversational AI.
discussion

opened by PolinaMrdv 0

Releases(1.0.1)

1.0.1(Nov 22, 2022)
Major Features and Improvements

Added -i/--install CLI argument and install argument to deeppavlov.build_model, deeppavlov.evaluate_model, deeppavlov.train_model to install model requirements before interaction with model (#1603).

Bug Fixes and Other Changes

Reduced library verbosity: redundant logging info messages replaced with debug ones. Set nltk.download to quiet mode. (#1601).

Replaced docs/features/models/classifiers.rst with docs/features/models/classification.ipynb. Fixed minor typos in documentation, removed skill concept (#1600).

Removed /examples from the README.md links (#1602).

Source code(tar.gz)
Source code(zip)
1.0.0(Nov 8, 2022)
Breaking Changes

Changed riseapi mode response format (#1585).

Removed support for TensorFlow v1.x: removed all TF-based components, removed TF mentions from documentation, default train class replaced with torch_trainer (#1574).

TensorFlow-based models were replaced with the PyTorch-based ones, some models were renamed, various models and components were removed.

Replaced Models:

Entity Linking (#1516)

Context Question Answering (#1539)

NER (#1545)

Classifier models (#1565)

Removed Models

Classifiers, Doc Retrieval, Go-Bot, Neural Morphological Tagging, NER, ODQA, Ranking, Spelling Correction, Context Question Answering (#1523)

ASR, TTS (#1526)

ELMO (#1533)

Chinese Context Question Answering (#1534)

Ranking Models (#1537)

Go-Bot (#1544)

KBQA, Multitask BERT (#1560)

Intent Catcher (#1564)

Morpho/Syntax Models (#1573)

Removed Components

Connectors to: Telegram, Microsoft Bot Framework, Amazon Alexa, Yandex Alice (#1548)

Various components (#1563)

Removed serialization mechanism

Major Features and Improvements

Python 3.8/3.9 support (#1525).

Added nested configs overwriting mechanism (#1561).

Added case-agnostic distil NER for DREAM (#1570).

Added DeepPavlov Topics Classifier model (#1584).

Added Russian SuperGLUE models (#1577).

Added external metrics support (#1546).

Added Jupyter Notebook support to documentation (#1592).

Bug Fixes and Other Changes

Requirements updated (#1578).

Models deprecation mechanism (#1547).

Uploaded DeepPavlov BERT models with MLM & NSP heads parameters (#1502).

Fixed en_core_web_sm loading error (#1524)

Fixed NER models table view (#1529)

Removed special version of Transformers library for certain components (#1532)

Fixed tests (#1543)

Updated library output during model training (#1572)

Fixed ConnectionResetError handling in simple_download (#1586)

Minor fixes in KBQA models (#1591)

Added iterations count and speed output during training (#1593)

Fixed datasets version (#1596)

Source code(tar.gz)
Source code(zip)
0.17.6(Sep 16, 2022)
Changed links to group https://github.com/deeppavlov

Source code(tar.gz)
Source code(zip)
0.17.5(Sep 16, 2022)
fixed broken links from https://github.com/deepmipt to https://github.com/deeppavlovteam

Source code(tar.gz)
Source code(zip)
1.0.0rc1(Jul 17, 2022)
Note: DeepPavlov 1.0.0 is not released yet!

DeepPavlov 1.0.0 Release Notes

Added Python 3.8 and 3.9 support and library requirements are optimized in #1525 and #1578.

Removed all TensorFlow components and default trainer replaced with torch_trainer in #1574.

Added Russian SuperGLUE models and submission generation in #1577.

Added NER case-agnostig config in #1570.

Added external metrics support in #1546.

Nested config overwriting mechanism in #1561.

Refactoring of the training logging in #1572.

KBQA models migrated to PyTorch in #1569.

Classification models migrated to PyTorch in #1565.

NER models migrated to PyTorch in #1545.

Context question answering models migrated to PyTorch in #1539.

Entity Linking migrated to PyTorch and reduced RAM and VRAM consumption in #1516.

Added config deprecation mechanism in #1547.

torch_bert_ranker now uses the same Hugging Face Transformers version as the rest of the components in #1532.

Models and components removed in #1523, #1526, #1534, #1533, #1537, #1544, #1560, #1563, #1564, #1573.

Fixed a problem with pre-trained BERT models by DeepPavlov in #1502. Resolves #1275 and #1390.

Fixed en_core_web_sm load error during tests in #1524.

Removed Telegram, MSBot Framework, Yandex Alice and Amazon Alexa connectors in #1548.

Documentation updated in #1517, #1529.

Source code(tar.gz)
Source code(zip)
0.17.4(May 31, 2022)
fix TypeError: Descriptors cannot not be created directly. for Python 3.7.

Source code(tar.gz)
Source code(zip)
0.17.3(Apr 27, 2022)
fix AttributeError module 'lib' has no attribute 'X509_V_FLAG_CB_ISSUER_CHECK.

Source code(tar.gz)
Source code(zip)
1.0.0rc0(Mar 29, 2022)
Renamed models:

When a.json is renamed to b.json, original b.json is removed.

squad_ru_torch_bert -> squad_ru_bert

ner_rus_bert_torch -> ner_rus_bert

insults_kaggle_bert_torch -> insults_kaggle_bert

TensorFlow replaced by PyTorch

squad_ru_bert_infer

Removed models:

asr

asr.json

tts.json

asr_tts.json

elmo

rusentiment_elmo_twitter_cnn

elmo_en_1billion

elmo_ru_news

elmo_ru_twitter

elmo_ru_wiki

classifiers

insults_kaggle

insults_kaggle_conv_bert

intents_dstc2

intents_dstc2_bert

intents_dstc2_big

intents_sample_csv

intents_sample_json

intents_snips

intents_snips_big

intents_snips_sklearn

intents_snips_tfidf_weighted

relation_prediction_rus

ru_obscenity_classifier

rusentiment_bigru_superconv

rusentiment_cnn

sentiment_imdb_bert

sentiment_imdb_conv_bert

sentiment_sst_multi_bert

sentiment_twitter_bert_emb

sentiment_twitter_preproc

sentiment_yelp_conv_bert

sentiment_yelp_multi_bert

sst_torch_swcnn

topic_ag_news

yahoo_convers_vs_info

yahoo_convers_vs_info_bert

doc_retrieval

en_ranker_tfidf_enwiki20161221

go_bot

database_dstc2

gobot_dstc2

gobot_dstc2_best

gobot_dstc2_best_json_nlg

gobot_simple_dstc2

morpho_tagger

morpho_ar

morpho_cs

morpho_de

morpho_en

morpho_es_ancora

morpho_fr

morpho_hi

morpho_hu

morpho_it

morpho_ru_syntagrus

morpho_ru_syntagrus_pymorphy

morpho_ru_syntagrus_pymorphy_lemmatize

morpho_tr

ner

ner_conll2003_pos

ner_dstc2

ner_few_shot_ru

ner_few_shot_ru_simulate

ner_kb_rus

ner_lcquad_bert_probas

ner_ontonotes_m1

slotfill_dstc2

slotfill_dstc2_raw

slotfill_simple_dstc2_raw

slotfill_simple_rasa_raw

vlsp2016_full

odqa

en_odqa_infer_enwiki20161221

ranking

paraphrase_ident_paraphraser

paraphrase_ident_paraphraser_interact

ranking_ubuntu_v2_mt

ranking_ubuntu_v2_mt_interact

spelling_correction

brillmoore_kartaslov_ru

brillmoore_kartaslov_ru_custom_vocab

brillmoore_kartaslov_ru_nolm

squad

squad_bert_uncased

squad_zh_bert_mult

squad_zh_bert_zh

Removed components:

ner_few_shot_iterator

ru_obscenity_classifier

snips_intents_iterator

snips_ner_iterator

snips_reader

mpm_nn

bilstm_gru_nn

morpho_tagger

siamese_predictor

elmo_embedder

ner_bio_converter

ner_svm

jieba_tokenizer

base64_decode_bytesIO

bytesIO_encode_base64

nemo_asr

nemo_tts

Other features:

Python 3.8 and 3.9 support for non-tensorflow-based models

Upload DeepPavlov BERT models with MLM & NSP heads parameters

removed examples directory

minor fixes and improvements

Source code(tar.gz)
Source code(zip)
0.17.2(Dec 16, 2021)
Removed 12 configuration files and 5 components (1498, 1499)

SuperGLUE models updated

Source code(tar.gz)
Source code(zip)
0.17.1(Sep 28, 2021)
Pipeline building syntax using python is simplified

Source code(tar.gz)
Source code(zip)
0.17.0(Sep 7, 2021)
A relation extraction model

A ReCoRD model is based on Reading Comprehension with Commonsense Reasoning Dataset

Source code(tar.gz)
Source code(zip)
0.16.0(Aug 2, 2021)
A distilled Russian BERT

New Pytorch-based models for classification, question answering and named entity recognition

Removed some configuration files and components

Small fixes and updates

Source code(tar.gz)
Source code(zip)
0.15.0(May 14, 2021)
bert_as_summarizer, seq2seq_go_bot, hyperparameter optimization by neural evolution and all deeppavlov.deprecated components were removed

NER automodel

Сomponent-based config requirements generation

Minor edits and fixes

Source code(tar.gz)
Source code(zip)
0.14.1(Apr 3, 2021)
Fixed installation on python3.6

Source code(tar.gz)
Source code(zip)
0.14.0(Dec 24, 2020)
Intent Catcher component

Support of the Hugginface Transformers for classification

Go-Bot formfilling (tutorial)

Entity Linking, Wiki Parser and KBQA as separate components

Minor edits and fixes

Source code(tar.gz)
Source code(zip)
0.13.0(Nov 13, 2020)
Hugging Face datasets support

Go-Bot now requires only RASA-based configs to train its components (intent catcher, slot filler, dialogue state tracker also known as Go-Bot itself)

Prometheus metrics middleware for REST web service

KBQA models fixes

Minor edits of the documentation

Source code(tar.gz)
Source code(zip)
0.12.1(Sep 9, 2020)
Entity Linking for Wikidata (English)

Bugfixes in KBQA model

Other minor improvements

Source code(tar.gz)
Source code(zip)
0.12.0(Aug 11, 2020)
PyTorch Support (w/ Examples)

Multi-task BERT

Basic RASA Configs Support for Go-Bot

Entity Linking

top_n Answers API Update in ODQA Models

Hybrid NER Models Trained on OntoNotes: Ru | En

BoolQ Dataset Reader

Source code(tar.gz)
Source code(zip)
0.11.0(Jun 30, 2020)
Dataset generation tool with a tutorial for Goal-Oriented Dialog Bot

Undocumented feature for downloading artifacts from Amazon s3

New KBQA pipeline for online version of Wikidata

First version of a KBQA pipeline using Syntax tree for generating SPARQL-queries

And many small improvements and fixes

Source code(tar.gz)
Source code(zip)
0.10.0(May 27, 2020)
New Knowledge Base Question Answering model for WikiData

Training interfaces for KBQA models

Refactored goal-oriented bot architecture

And many small improvements and fixes

Source code(tar.gz)
Source code(zip)
0.9.1(Apr 27, 2020)
Fix requirements for ASR and TTS configs

Source code(tar.gz)
Source code(zip)
0.9.0(Apr 20, 2020)
A dataset reader for the IMDB Large Movie Review Dataset and sample configurations for training classifiers on it

Models for speech recognition and synthesis based on NVIDIA NeMo

A New hybrid NER model pre-trained for English and Vietnamese languages

A pre-trained NER-based Model for Sentence Boundary Detection Task

Many smaller changes and fixes

Source code(tar.gz)
Source code(zip)
0.8.0(Feb 26, 2020)
Configuration and model for DRCD (Chinese SQUAD) based on Chinese BERT

Remove all Keras usage in favor of tensorflow.keras

Start refactoring of goal-oriented bot code

Update dependencies versions

Add a BERT-embedding component as a first step of moving from google-research/bert to HuggingFace's Transformers

Release BERT-based sentence embedders models

Smaller changes and fixes

Source code(tar.gz)
Source code(zip)
0.7.1(Nov 29, 2019)
BERT for Extractive Summarization

Source code(tar.gz)
Source code(zip)
0.7.0(Nov 28, 2019)
Replace Flask with FastAPI everywhere

A syntax parser model

Deprecate Agent and Skill classes in favor of DeepPavlov Agent

Change DeepPavlov settings structure

Code style fixes

Many smaller changes and fixes

Source code(tar.gz)
Source code(zip)
0.6.1(Sep 26, 2019)
New gobot tutorial

Replace Flask with FastAPI for riseapi

New section in quickstart documentation about out-of-the-box pretrained models

Minor NER dataset reader fix

Other minor documentation changes

Source code(tar.gz)
Source code(zip)
0.6.0(Sep 6, 2019)
New risesocket mode

A wrapper for Rasa skills

New gobot tutorial

Conversational Bert in Russian

Minor fixes and documentation updates

Source code(tar.gz)
Source code(zip)
0.5.1(Aug 13, 2019)
Downloads resuming after interruptions

Small documentation tweaks

Multilingual SQUAD configuration for inference

Source code(tar.gz)
Source code(zip)
0.5.0(Jul 29, 2019)
Conversational BERT model in English (trained on informal lexicon data)

Updated requirements

Python 3.7 support

New DSL for building rule-based skills

New SOTA insult-detection model based on the conversational BERT

Updated documentation

Refactored MorphoTagger classes to better conform DeepPavlov ideology

Many smaller changes and fixes

Source code(tar.gz)
Source code(zip)
0.4.0(Jun 27, 2019)
New rule-based obscenity classifier for Russian language

Lemmatization option in Morphotagger pipeline for Russian language

Unification of default configuration variables as per #844

Fixes in knowledge base dataset readers

other minor fixes

Source code(tar.gz)
Source code(zip)

An open source library for deep learning end-to-end dialog systems and chatbots.

Related tags

Overview

Quick Links

Installation

QuickStart

GPU requirements

Command line interface (CLI)

Python

Breaking Changes

License

The Team

Comments

Releases(1.0.1)

1.0.1(Nov 22, 2022)

Major Features and Improvements

Bug Fixes and Other Changes

1.0.0(Nov 8, 2022)

Breaking Changes

Major Features and Improvements

Bug Fixes and Other Changes

0.17.6(Sep 16, 2022)

0.17.5(Sep 16, 2022)

1.0.0rc1(Jul 17, 2022)

DeepPavlov 1.0.0 Release Notes

0.17.4(May 31, 2022)

0.17.3(Apr 27, 2022)

1.0.0rc0(Mar 29, 2022)

Renamed models:

TensorFlow replaced by PyTorch

Removed models:

Removed components:

Other features:

0.17.2(Dec 16, 2021)

0.17.1(Sep 28, 2021)

0.17.0(Sep 7, 2021)

0.16.0(Aug 2, 2021)

0.15.0(May 14, 2021)

0.14.1(Apr 3, 2021)

0.14.0(Dec 24, 2020)

0.13.0(Nov 13, 2020)

0.12.1(Sep 9, 2020)

0.12.0(Aug 11, 2020)

0.11.0(Jun 30, 2020)

0.10.0(May 27, 2020)

0.9.1(Apr 27, 2020)

0.9.0(Apr 20, 2020)

0.8.0(Feb 26, 2020)

0.7.1(Nov 29, 2019)

0.7.0(Nov 28, 2019)

0.6.1(Sep 26, 2019)

0.6.0(Sep 6, 2019)

0.5.1(Aug 13, 2019)

0.5.0(Jul 29, 2019)

0.4.0(Jun 27, 2019)

Owner

Neural Networks and Deep Learning lab, MIPT

Simple Speech to Text, Text to Speech

Codes for processing meeting summarization datasets AMI and ICSI.

基于Transformer的单模型、多尺度的VAE模型

Some embedding layer implementation using ivy library

A natural language modeling framework based on PyTorch

GPT-2 Model for Leetcode Questions in python

VoiceFixer VoiceFixer is a framework for general speech restoration.

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

NLP codes implemented with Pytorch (w/o library such as huggingface)

SimCTG - A Contrastive Framework for Neural Text Generation

Yet another Python binding for fastText

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Easy to start. Use deep nerual network to predict the sentiment of movie review.

Two-stage text summarization with BERT and BART

Poetry PEP 517 Build Backend & Core Utilities

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and GPT-NEO (2.7 B) on a single 16 GB VRAM V100 Google Cloud instance with Huggingface Transformers using DeepSpeed

Chatbot for the Chatango messaging platform

뉴스 도메인 질의응답 시스템 (21-1학기 졸업 프로젝트)

Generating Korean Slogans with phonetic and structural repetition

Shellcode antivirus evasion framework