| adbar/trafilatura |
2,447 |
|
0 |
66 |
about 2 years ago |
39 |
November 29, 2023 |
66 |
gpl-3.0 |
Python |
| Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments |
| shangjingbo1226/AutoPhrase |
978 |
|
0 |
0 |
about 4 years ago |
3 |
November 19, 2020 |
6 |
apache-2.0 |
C++ |
| AutoPhrase: Automated Phrase Mining from Massive Text Corpora |
| adbar/German-NLP |
360 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
|
|
| Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German |
| ko-ichi-h/khcoder |
295 |
|
0 |
0 |
over 2 years ago |
0 |
|
10 |
gpl-2.0 |
Perl |
| KH Coder: for Quantitative Content Analysis or Text Mining |
| oroszgy/awesome-hungarian-nlp |
192 |
|
0 |
0 |
over 2 years ago |
0 |
|
1 |
|
|
| A curated list of NLP resources for Hungarian |
| PolMine/polmineR |
45 |
|
1 |
2 |
over 2 years ago |
22 |
October 29, 2023 |
45 |
|
HTML |
| R-package for text mining with the Corpus Workbench (CWB) as backend |
| luozhouyang/AutoPhraseX |
38 |
|
0 |
0 |
almost 5 years ago |
4 |
May 23, 2021 |
0 |
apache-2.0 |
Python |
| Automated Phrase Mining from Massive Text Corpora in Python. |
| nicolasassi/gomtch |
26 |
|
0 |
0 |
over 4 years ago |
2 |
August 11, 2021 |
0 |
bsd-3-clause |
Go |
| Find text even if it doesn't want to be found |
| yumeng5/JoSH |
20 |
|
0 |
0 |
about 5 years ago |
0 |
|
2 |
apache-2.0 |
C |
| [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding |
| hrbrmstr/elpresidente |
19 |
|
0 |
0 |
almost 8 years ago |
0 |
|
1 |
|
R |
| 🇺🇸 Search and Extract Corpus Elements from 'The American Presidency Project' |