Python binding for Morfologik

Morfologik is Polish morphological analyzer. For more information see http://github.com/morfologik/morfologik-stemming/ and http://http://www.morfologik.blogspot.com/

Requirements

This binding works with Python 2 and Python 3.

Installation

Install it from pip

pip install pyMorfologik

or directly from github

git clone https://github.com/dmirecki/pyMorfologik.git

Usage

Now, only simple stems are supported:

>>> from pymorfologik import Morfologik
>>> from pymorfologik.parsing import ListParser
>>>
>>> parser = ListParser()
>>> stemmer = Morfologik()
>>> stemmer.stem(['Ala ma kota'], parser)
[(u'Ala',
  {u'Al': [u'subst:sg:acc:m1+subst:sg:gen:m1'],
   u'Ala': [u'subst:sg:nom:f'],
   u'Alo': [u'subst:sg:acc:m1+subst:sg:gen:m1']}),
 (u'ma',
  {u'mieć': [u'verb:fin:sg:ter:imperf:refl.nonrefl'],
   u'mój': [u'adj:sg:nom.voc:f:pos']}),
 (u'kota', {u'kot': [u'subst:sg:acc:m1'], u'kota': [u'subst:sg:nom:f']})]

Acknowledgements

This repo is based on Morfologik, a great contribution of Marcin Miłowski (http://marcinmilkowski.pl) and Dawid Weiss (http://www.dawidweiss.com).

Contributions

Damian Mirecki

Adrian Bohdanowicz

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Related tags

Overview

Python binding for Morfologik

Requirements

Installation

Usage

Acknowledgements

Contributions

Owner

Damian Mirecki

RIDE automatically creates the package and boilerplate OOP Python node scripts as per your needs

Code for paper "Role-oriented Network Embedding Based on Adversarial Learning between Higher-order and Local Features"

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

The swas programming language

Build Text Rerankers with Deep Language Models

Multilingual finetuning of Machine Translation model on low-resource languages. Project for Deep Natural Language Processing course.

An automated program that helps customers of Pizza Palour place their pizza orders

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks

中文无监督SimCSE Pytorch实现

This repository contains the code for "Generating Datasets with Pretrained Language Models".

Code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation".

Signature remover is a NLP based solution which removes email signatures from the rest of the text.

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

A Transformer Implementation that is easy to understand and customizable.

2021 AI CUP Competition on Traditional Chinese Scene Text Recognition - Intermediate Contest

Calibre recipe to convert latest issue of Analyse & Kritik into an ebook

pysentimiento: A Python toolkit for Sentiment Analysis and Social NLP tasks

Ceaser-Cipher - The Caesar Cipher technique is one of the earliest and simplest method of encryption technique

[NeurIPS 2021] Code for Learning Signal-Agnostic Manifolds of Neural Fields