Python library for parsing resumes using natural language processing and machine learning

Last update: Jul 29, 2021

Overview

CVParser

Python library for parsing resumes using natural language processing and machine learning.

Setup

Installation on Linux and Mac OS

Follow the guide here on how to clone or fork a repo
Follow the guide here on how to create virtualenv

To create a normal virtualenv (example myvenv) and activate it (see Code below).

$ virtualenv --python=python3 myvenv

$ source myvenv/bin/activate

(myvenv) $ pip install -r requirements.txt

Usage

from cvparser.parser import CVParser

CVParser.download_nlk_data()


parser = CVParser(file_path="path/to/file.[pdf|doc|docx|png|jpeg]")
parser.parse()
print(parser.json())

Re-training the Model

cd into the train folder.
Delete the folder model and the file train.json.
Copy your new training data into the train folder. The train data must be in json. This can be generated using the data annotation tool called Dataturk. The file containing the training data must be named train.json.
Then, start re-training the model by execute the python script in the train folder named manual_training.py.
Then test your new model by #usage .

Python library for parsing resumes using natural language processing and machine learning

Related tags

Overview

CVParser

Setup

Installation on Linux and Mac OS

Usage

Re-training the Model

Owner

nafiu

Text-Based zombie apocalyptic decision-making game in Python

Ceaser-Cipher - The Caesar Cipher technique is one of the earliest and simplest method of encryption technique

Neural text generators like the GPT models promise a general-purpose means of manipulating texts.

Voilà turns Jupyter notebooks into standalone web applications

A Flask Sentiment Analysis API, with visual implementation

Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.

The official implementation of "BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?, ACL 2021 main conference"

Train GPT-3 model on V100(16GB Mem) Using improved Transformer.

A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.

This converter will create the exact measure for your cappuccino recipe from the grandiose Rafaella Ballerini!

Artificial Conversational Entity for queries in Eulogio "Amang" Rodriguez Institute of Science and Technology (EARIST)

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.

Finetune gpt-2 in google colab

NLP made easy

Official code for "Parser-Free Virtual Try-on via Distilling Appearance Flows", CVPR 2021

Various capabilities for static malware analysis.

Searching keywords in PDF file folders

✨Fast Coreference Resolution in spaCy with Neural Networks

Tools for curating biomedical training data for large-scale language modeling

🐍 A hyper-fast Python module for reading/writing JSON data using Rust's serde-json.