Wiki Split Alternatives

Name: google-research-datasets/wiki-split
Brand: google-research-datasets/wiki-split
SKU: project/google-research-datasets/wiki-split
Rating: 4.45 (72 reviews)

One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.

Categories > Companies > Deep Learning

Suggest Alternative

Stars

Alternatives

License

No license specified

Open Issues

Most Recent Commit

almost 7 years ago

Dependent Repos

Dependent Packages

Total Releases

Categories

Machine Learning > Deep Learning

Data Processing > Dataset

Learning Resources > Paper

Machine Learning > Natural Language Processing

Companies > Wikipedia

Repo

Alternatives To google-research-datasets/wiki-split

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Open Issues	License	Language
roboreport/doc2vec-api	92	0	0	over 3 years ago	0	1	lgpl-2.1	Python
document embedding and machine learning script for beginners
Hironsan/ja.text8	74	0	0	over 8 years ago	0	0		Python
Japanese text8 corpus for word embedding.
koomri/text-segmentation	73	0	0	over 6 years ago	0	3		Python
Implementation of the paper: Text Segmentation as a Supervised Learning Task
google-research-datasets/wiki-split	72	0	0	almost 7 years ago	0	2
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
blei-lab/deep-exponential-families	53	0	0	about 8 years ago	0	0		C++
Deep exponential families (DEFs)
google-research-datasets/wiki-atomic-edits	47	0	0	almost 7 years ago	0	1
A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.
EagleW/Describing_a_Knowledge_Base	42	0	0	almost 5 years ago	0	0	mit	Python
Code for Describing a Knowledge Base
thoppe/today-AI-learned	35	0	0	over 10 years ago	0	0		Python
Training a classifier to reddit's TIL to find new things on Wikipedia
rodrigosetti/dbn-cuda	34	0	0	almost 11 years ago	0	0		Python
GPU accelerated Deep Belief Network
todd-cook/ML-You-Can-Use	24	0	0	about 4 years ago	0	3	other	Jupyter Notebook
Practical ML and NLP with examples.

Alternatives To google-research-datasets/wiki-split

Select To Compare

roboreport/doc2vec-api ⭐ 92

document embedding and machine learning script for beginners

dependent packages 0 total releases 0 most recent commit over 3 years ago

Hironsan/ja.text8 ⭐ 74

Japanese text8 corpus for word embedding.

dependent packages 0 total releases 0 most recent commit over 8 years ago

koomri/text-segmentation ⭐ 73

Implementation of the paper: Text Segmentation as a Supervised Learning Task

dependent packages 0 total releases 0 most recent commit over 6 years ago

google-research-datasets/wiki-split ⭐ 72

One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.

dependent packages 0 total releases 0 most recent commit almost 7 years ago

blei-lab/deep-exponential-families ⭐ 53

Deep exponential families (DEFs)

dependent packages 0 total releases 0 most recent commit about 8 years ago

google-research-datasets/wiki-atomic-edits ⭐ 47

A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.

dependent packages 0 total releases 0 most recent commit almost 7 years ago

EagleW/Describing_a_Knowledge_Base ⭐ 42

Code for Describing a Knowledge Base

dependent packages 0 total releases 0 most recent commit almost 5 years ago

thoppe/today-AI-learned ⭐ 35

Training a classifier to reddit's TIL to find new things on Wikipedia

dependent packages 0 total releases 0 most recent commit over 10 years ago

rodrigosetti/dbn-cuda ⭐ 34

GPU accelerated Deep Belief Network

dependent packages 0 total releases 0 most recent commit almost 11 years ago

todd-cook/ML-You-Can-Use ⭐ 24

Practical ML and NLP with examples.

dependent packages 0 total releases 0 most recent commit about 4 years ago

Suggest An Alternative To wiki-split

Alternative Project Comparisons

google-research-datasets/wiki-split vs Doc2vec Api

google-research-datasets/wiki-split vs Ja.text8

google-research-datasets/wiki-split vs Text Segmentation

google-research-datasets/wiki-split vs Wiki Split

google-research-datasets/wiki-split vs Deep Exponential Families

google-research-datasets/wiki-split vs Wiki Atomic Edits

google-research-datasets/wiki-split vs Describing_a_knowledge_base

google-research-datasets/wiki-split vs Today Ai Learned

google-research-datasets/wiki-split vs Dbn Cuda

google-research-datasets/wiki-split vs Ml You Can Use

Popular Wikipedia Projects

kamranahmedse/design-patterns-for-humans⭐ 42,678

An ultra-simplified explanation to design patterns

dwmkerr/hacker-laws⭐ 24,993

💻📖 Laws, Theories, Principles and Patterns that developers will find useful. #hackerlaws

wikimedia/mediawiki⭐ 4,936

🌻 The collaborative editing software that runs Wikipedia. Mirror from https://gerrit.wikimedia.org/g/mediawiki/core. See https://mediawiki.org/wiki/Developer_access for contributing.

sohamkamani/javascript-design-patterns-for-humans⭐ 4,191

An ultra-simplified explanation of design patterns implemented in javascript

attardi/wikiextractor⭐ 3,440

A tool for extracting plain text from Wikipedia dumps

Popular Deep Learning Projects

tensorflow/tensorflow⭐ 180,196

An Open Source Machine Learning Framework for Everyone

huggingface/transformers⭐ 119,240

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

AUTOMATIC1111/stable-diffusion-webui⭐ 118,856

Stable Diffusion web UI

pytorch/pytorch⭐ 74,794

Tensors and Dynamic neural networks in Python with strong GPU acceleration

opencv/opencv⭐ 73,748

Open Source Computer Vision Library

Popular Companies Categories

Google

Microsoft

Amazon

Apple

Intel

Oracle

Nvidia

Ibm

Netlify

Elastic