Kochat

챗봇 빌더는 성에 안차고, 자신만의 딥러닝 챗봇 애플리케이션을 만드시고 싶으신가요?
Kochat을 이용하면 손쉽게 자신만의 딥러닝 챗봇 애플리케이션을 빌드할 수 있습니다.

# 1. 데이터셋 객체 생성
dataset = Dataset(ood=True)

# 2. 임베딩 프로세서 생성
emb = GensimEmbedder(model=embed.FastText())

# 3. 의도(Intent) 분류기 생성
clf = DistanceClassifier(
    model=intent.CNN(dataset.intent_dict),                  
    loss=CenterLoss(dataset.intent_dict)                    
)

# 4. 개체명(Named Entity) 인식기 생성                                                     
rcn = EntityRecognizer(
    model=entity.LSTM(dataset.entity_dict),
    loss=CRFLoss(dataset.entity_dict)
)

# 5. 딥러닝 챗봇 RESTful API 학습 & 빌드
kochat = KochatApi(
    dataset=dataset, 
    embed_processor=(emb, True), 
    intent_classifier=(clf, True),
    entity_recognizer=(rcn, True), 
    scenarios=[
        weather, dust, travel, restaurant
    ]
)

# 6. View 소스파일과 연결                                                                                                        
@kochat.app.route('/')
def index():
    return render_template("index.html")

# 7. 챗봇 애플리케이션 서버 가동                                                          
if __name__ == '__main__':
    kochat.app.template_folder = kochat.root_dir + 'templates'
    kochat.app.static_folder = kochat.root_dir + 'static'
    kochat.app.run(port=8080, host='0.0.0.0')

Why Kochat?

한국어를 지원하는 최초의 오픈소스 딥러닝 챗봇 프레임워크입니다. (빌더와는 다릅니다.)
다양한 Pre built-in 모델과 Loss함수를 지원합니다. NLP를 잘 몰라도 챗봇을 만들 수 있습니다.
자신만의 커스텀 모델, Loss함수를 적용할 수 있습니다. NLP 전문가에겐 더욱 유용합니다.
챗봇에 필요한 데이터 전처리, 모델, 학습 파이프라인, RESTful API까지 모든 부분을 제공합니다.
가격 등을 신경쓸 필요 없으며, 앞으로도 쭉 오픈소스 프로젝트로 제공할 예정입니다.
아래와 같은 다양한 성능 평가 메트릭과 강력한 시각화 기능을 제공합니다.

Documentation

Reference

License

Copyright 2020 Hyunwoong Ko.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

kochat

Related tags

Overview

Kochat

Why Kochat?

Documentation

Reference

License

Owner

Klexikon: A German Dataset for Joint Summarization and Simplification

Code for EMNLP20 paper: "ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training"

Utility for Google Text-To-Speech batch audio files generator. Ideal for prompt files creation with Google voices for application in offline IVRs

Journey is a NLP-Powered Developer assistant

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

基于pytorch_rnn的古诗词生成

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking

Natural Language Processing for Adverse Drug Reaction (ADR) Detection

ACL'2021: Learning Dense Representations of Phrases at Scale

Mycroft Core, the Mycroft Artificial Intelligence platform.

Poetry PEP 517 Build Backend & Core Utilities

Python module (C extension and plain python) implementing Aho-Corasick algorithm

Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.

使用pytorch+transformers复现了SimCSE论文中的有监督训练和无监督训练方法

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Pretty-doc - Composable text objects with python

DELTA is a deep learning based natural language and speech processing platform.

Example code for "Real-World Natural Language Processing"

AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems