Word2vec Chinese Alternatives

Name: lzhenboy/word2vec-Chinese
Brand: lzhenboy/word2vec-Chinese
SKU: project/lzhenboy/word2vec-Chinese
Rating: 4.43 (34 reviews)

a tutorial for training Chinese-word2vec using Wiki corpus

Categories > Data Processing > Tutorials

Suggest Alternative

Stars

Alternatives

License

No license specified

Open Issues

Most Recent Commit

over 6 years ago

Programming Language

Python

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Python

Learning Resources > Tutorials

Content Management > Wiki

Community > Chinese

Data Processing > Corpus

Machine Learning > Word2vec

Machine Learning > Jieba

Repo

Alternatives To lzhenboy/word2vec-Chinese

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
brightmart/nlp_chinese_corpus	8,344	0	0	almost 3 years ago	0		20	mit
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
UCDenver-ccp/CRAFT	58	0	0	over 3 years ago	0		1	other	Clojure
dav009/abacus	42	0	0	over 8 years ago	0	May 24, 2021	0		Go
Counter Data structure for Golang using CountMin Sketch with a fixed amount of memory
lzhenboy/word2vec-Chinese	34	0	0	over 6 years ago	0		1		Python
a tutorial for training Chinese-word2vec using Wiki corpus
insikk/namu_wiki_db_preprocess	22	0	0	almost 9 years ago	0		0	apache-2.0	Jupyter Notebook
A python script to convert namu wiki database to huge Korean language corpus
JiaLiangShen/Chinese-Article-Classification-based-on-own-corpus-via-TextCNN-and-GBDT	16	0	0	almost 8 years ago	0		1		Python
中文文本分类，包含了语料库的基本处理，Wiki_zh的处理等
mmcctt00/SpanishTransformerXL	12	0	0	over 6 years ago	0		0		Jupyter Notebook
Language model trained on wiki corpus (500M tokens) with fastai v1 acc>42.3% len(vocab)=60K
uma-pi1/OPIEC	12	0	0	almost 7 years ago	0		0	gpl-3.0	Java
Reading the data from OPIEC - an Open Information Extraction corpus
CyberZHG/wiki-dump-reader	10	0	1	about 7 years ago	4	February 01, 2019	2	mit	Python
Extract corpora from Wikipedia dumps
zhouhoo/wiki_zh_vec	7	0	0	over 9 years ago	0		0	apache-2.0	Python
a python autotool for train Chinese wiki corpus to word embeddings using word2vec ,glove and lexvec.

Alternatives To lzhenboy/word2vec-Chinese

Select To Compare

brightmart/nlp_chinese_corpus ⭐ 8,344

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

dependent packages 0 total releases 0 most recent commit almost 3 years ago

UCDenver-ccp/CRAFT ⭐ 58

dependent packages 0 total releases 0 most recent commit over 3 years ago

dav009/abacus ⭐ 42

Counter Data structure for Golang using CountMin Sketch with a fixed amount of memory

dependent packages 0 total releases 0 most recent commit over 8 years ago

lzhenboy/word2vec-Chinese ⭐ 34

a tutorial for training Chinese-word2vec using Wiki corpus

dependent packages 0 total releases 0 most recent commit over 6 years ago

insikk/namu_wiki_db_preprocess ⭐ 22

A python script to convert namu wiki database to huge Korean language corpus

dependent packages 0 total releases 0 most recent commit almost 9 years ago

JiaLiangShen/Chinese-Article-Classification-based-on-own-corpus-via-TextCNN-and-GBDT ⭐ 16

中文文本分类，包含了语料库的基本处理，Wiki_zh的处理等

dependent packages 0 total releases 0 most recent commit almost 8 years ago

mmcctt00/SpanishTransformerXL ⭐ 12

Language model trained on wiki corpus (500M tokens) with fastai v1 acc>42.3% len(vocab)=60K

dependent packages 0 total releases 0 most recent commit over 6 years ago

uma-pi1/OPIEC ⭐ 12

Reading the data from OPIEC - an Open Information Extraction corpus

dependent packages 0 total releases 0 most recent commit almost 7 years ago

CyberZHG/wiki-dump-reader ⭐ 10

Extract corpora from Wikipedia dumps

dependent packages 1 total releases 4 most recent commit about 7 years ago downloads badge

zhouhoo/wiki_zh_vec ⭐ 7

a python autotool for train Chinese wiki corpus to word embeddings using word2vec ,glove and lexvec.

dependent packages 0 total releases 0 most recent commit over 9 years ago

Suggest An Alternative To word2vec-Chinese

Alternative Project Comparisons

lzhenboy/word2vec-Chinese vs Nlp_chinese_corpus

lzhenboy/word2vec-Chinese vs Craft

lzhenboy/word2vec-Chinese vs Abacus

lzhenboy/word2vec-Chinese vs Word2vec Chinese

lzhenboy/word2vec-Chinese vs Namu_wiki_db_preprocess

lzhenboy/word2vec-Chinese vs Chinese Article Classification Based On Own Corpus Via Textcnn And Gbdt

lzhenboy/word2vec-Chinese vs Spanishtransformerxl

lzhenboy/word2vec-Chinese vs Opiec

lzhenboy/word2vec-Chinese vs Wiki Dump Reader

lzhenboy/word2vec-Chinese vs Wiki_zh_vec

Popular Corpus Projects

nltk/nltk⭐ 12,699

NLTK Source

nl8590687/ASRT_SpeechRecognition⭐ 7,253

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

stanfordnlp/GloVe⭐ 6,480

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

codertimo/BERT-pytorch⭐ 5,605

Google AI 2018 BERT pytorch implementation

ibab/tensorflow-wavenet⭐ 5,362

A TensorFlow implementation of DeepMind's WaveNet paper

Popular Wiki Projects

spring-projects/spring-framework⭐ 54,260

Spring Framework

AppFlowy-IO/AppFlowy⭐ 45,695

AppFlowy is an open-source alternative to Notion. You are in charge of your data and customizations. Built with Flutter and Rust.

requarks/wiki⭐ 27,763

Wiki.js | A modern and powerful wiki app built on Node.js

toeverything/AFFiNE⭐ 25,835

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.

zadam/trilium⭐ 24,366

Build your personal knowledge base with Trilium Notes

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper