Compute Sentence Embeddings Fast!
NLP 相关的一些文档、论文及代码, 包括主题模型(Topic Model)、词向量(Word Embedding)、命名实体识别(Named Entity Recognition)、文本分类(Text Classificatin)、文本生成(Text Generation)、文本相似性(Text Similarity)计算、机器翻译(Machine Translation)等，涉及到各种与nlp相关的算法，基于tensorflow 2.0。
Fast word vectors with little memory usage in Python
Continuous Machine Learning Training and Deployment on AWS SageMaker
A fast, efficient universal vector embedding utility package.
NLP, Text Mining and Machine Learning starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, keyword extraction with TFIDF, Text Classification with Logistic Regression, word count with pyspark, simple text pre
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Data repository for pretrained NLP models and NLP corpora.
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
ADAM - A Question Answering System. Inspired from IBM Watson
🦆 Contextually-keyed word vectors
word2vec uisng keras inside gensim
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Tutorial for Sentiment Analysis using Doc2Vec in gensim (or "getting 87% accuracy in sentiment analysis in under 100 lines of code")
Topic modeling with gensim and LDA
Topic Modelling for Humans
Word2Vec #Gensim #Python Word2Vec is a popular word embedding used in a lot of deep learning applications. In this video we use Gensim to train a ...
ArchSim booting ArchLinux and Xfce4 for ARMv7-A. The ARM processor simulator is generated from a high level architecture description. www.gensim.org.
Description What is the closest word to "king"? Is it "Canute" or is it "crowned"? There are many ways to define "similar words" and "similar texts". Depending on ...
Description https://github.com/bhargavvader/personal/tree/master/notebooks/text_analysis_tutorial This tutorial will guide you through the process of analysing ...
Description I used the Doc2Vec framework to analyze user comments on German online news articles and uncovered some interesting relations among the data ...
Filmed at PyData London 2017 www.pydata.org Description There are many ways to find similar words/docs with an open-source Natural Language processing ...
This video explains word2vec concepts and also helps implement it in gensim library of python. Word2vec extracts features from text and assigns vector ...
PyData London 2016 Python has great open source libraries to extract data from its most raw format - the human readable text. We will discuss a family of ...
PyData Seattle 2015 Gensim is fairly popular NLP library available in Python. In addition to having implementations of several popular algorithms, it has a ...
Gource visualization of gensim (https://github.com/piskvorky/gensim). Topic Modelling for Humans.