| adbar/trafilatura |
2,447 |
|
0 |
66 |
about 2 years ago |
39 |
November 29, 2023 |
66 |
gpl-3.0 |
Python |
| Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments |
| gsh199449/spider |
907 |
|
0 |
0 |
over 7 years ago |
0 |
|
3 |
gpl-3.0 |
Java |
| A configurable web spider with a easy-to-use web console |
| currentslab/extractnet |
118 |
|
0 |
0 |
over 2 years ago |
9 |
November 06, 2022 |
3 |
mit |
HTML |
| A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package |
| lisc-tools/lisc |
81 |
|
0 |
0 |
over 2 years ago |
5 |
October 15, 2023 |
1 |
apache-2.0 |
Python |
| Literature Scanner: Automated collection & analyses of the scientific literature. |
| ardauzunoglu/TRScraper |
47 |
|
0 |
0 |
about 5 years ago |
0 |
|
1 |
mit |
Python |
| TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madenciliği yapma imkanı sunan bir uygulamadır. |
| pesoto/Text-Analysis |
32 |
|
0 |
0 |
over 8 years ago |
0 |
|
0 |
|
Jupyter Notebook |
| Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling. |
| johnbumgarner/newshound |
25 |
|
0 |
0 |
about 3 years ago |
1 |
October 06, 2021 |
1 |
mit |
|
| This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages. |
| 0x0be/scrapeadvisor |
22 |
|
0 |
0 |
over 3 years ago |
0 |
|
|
|
Python |
| A user-friendly python-based GUI which provides sentiment analysis of users' reviews toward a specific TripAdvisor facility |
| 0x01h/hepsiburada-review-scraper |
20 |
|
0 |
0 |
over 6 years ago |
0 |
|
0 |
gpl-3.0 |
Python |
| Hepsiburada review/comment and rating scraper. Turkish text dataset creator for data science and NLP projects. 📜 |
| akshitvjain/restaurant-finder-featureReviews |
19 |
|
0 |
0 |
almost 6 years ago |
0 |
|
0 |
mit |
Python |
| Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews). |