A python framework to transform natural language questions to queries in a database query language.

Last update: Dec 18, 2022

Related tags

Overview

  __ _ _   _  ___ _ __  _   _
 / _` | | | |/ _ \ '_ \| | | |
| (_| | |_| |  __/ |_) | |_| |
 \__, |\__,_|\___| .__/ \__, |
    |_|          |_|    |___/

What's quepy?

Quepy is a python framework to transform natural language questions to queries in a database query language. It can be easily customized to different kinds of questions in natural language and database queries. So, with little coding you can build your own system for natural language access to your database.

Currently Quepy provides support for Sparql and MQL query languages. We plan to extended it to other database query languages.

An example

To illustrate what can you do with quepy, we included an example application to access DBpedia contents via their sparql endpoint.

You can try the example online here: Online demo

Or, you can try the example yourself by doing:

python examples/dbpedia/main.py "Who is Tom Cruise?"

And it will output something like this:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Thomas Cruise Mapother IV, widely known as Tom Cruise, is an...

The transformation from natural language to sparql is done by first using a special form of regular expressions:

person_name = Group(Plus(Pos("NNP")), "person_name")
regex = Lemma("who") + Lemma("be") + person_name + Question(Pos("."))

And then using and a convenient way to express semantic relations:

person = IsPerson() + HasKeyword(person_name)
definition = DefinitionOf(person)

The rest of the transformation is handled automatically by the framework to finally produce this sparql:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Using a very similar procedure you could generate and MQL query for the same question obtaining:

[{
    "/common/topic/description": [{}],
    "/type/object/name": "Tom Cruise",
    "/type/object/type": "/people/person"
}]

Installation

You need to have installed docopt and numpy. Other than that, you can just type:

pip install quepy

You can get more details on the installation here:

http://quepy.readthedocs.org/en/latest/installation.html

Learn more

You can find a tutorial here:

http://quepy.readthedocs.org/en/latest/tutorial.html

And the full documentation here:

http://quepy.readthedocs.org/

Join our mailing list

Contribute!

Want to help develop quepy? Welcome aboard! Find us in http://groups.google.com/group/quepy

A python framework to transform natural language questions to queries in a database query language.

Related tags

Overview

What's quepy?

An example

Installation

Learn more

Contribute!

Owner

Machinalis

BMInf (Big Model Inference) is a low-resource inference package for large-scale pretrained language models (PLMs).

Pipeline for fast building text classification TF-IDF + LogReg baselines.

Sentiment Analysis Project using Count Vectorizer and TF-IDF Vectorizer

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

A framework for implementing federated learning

Telegram AI chat bot written in Python using Pyrogram

小布助手对话短文本语义匹配的一个baseline

Biterm Topic Model (BTM): modeling topics in short texts

A method to generate speech across multiple speakers

NLP Text Classification

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

Bpe algorithm can finetune tokenizer - Bpe algorithm can finetune tokenizer

Words-per-minute - A terminal app written in python utilizing the curses module that tests the user's ability to type

Training RNNs as Fast as CNNs

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Extract Keywords from sentence or Replace keywords in sentences.

A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.

Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。

OceanScript is an Esoteric language used to encode and decode text into a formulation of characters

Programme de chiffrement et de déchiffrement inverse d'un message en python3.