Textextract Alternatives

textextract is a tiny library (87 lines of Go) that identifies where the article content is in a HTML page (as opposed to navigation, headers, footers, ads, etc), extracts it and returns it as a string. Like Boilerpipe but for Go in Go.
Suggest Alternative
Alternatives To emiruz/textextract
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
typpo/ad-detector 189 0 0 over 9 years ago 0 9 mit JavaScript
Detects articles with corporate sponsors.
edlea/DesktopAMP 113 0 0 almost 9 years ago 0 7 mit JavaScript
Safari and Chrome extensions to load the AMP version of webpages if available
MAVProxyUser/SilverPushUnmasked 73 0 0 over 10 years ago 0 0 HTML
SilverEdge Inc. SilverPush Demo Apps (unmasked)
sahildave/Gazetti_Newspaper_Reader 41 0 0 about 8 years ago 0 5 mit Java
[Deprecated] Instant News. On Your Fingertips
r-xue/ads2bibdesk 39 0 0 over 2 years ago 9 June 30, 2020 8 gpl-3.0 Python
ads2bibdesk helps you add astrophysics articles listed on NASA/ADS to your BibDesk database using the new ADS Developer API
jonathansick/ads_bibdesk 35 0 0 about 6 years ago 0 33 gpl-3.0 Python
(Unmaintained) Mac OS X service for frictionless import of NASA ADS and arXiv publications into BibDesk.
ComboStrap/combo 13 0 0 over 2 years ago 0 4 gpl-2.0 PHP
Dokuwiki Combo Plugin. Making Web Publication a Breeze
wangzailfm/XposedRemoveAd 12 0 0 over 8 years ago 0 0 Kotlin
Xposed Remove 微博国际版(Weico) the start Ad
deckerweb/automattic-humility 9 0 0 about 7 years ago 0 0 gpl-2.0 PHP
Humble yourself in the GDPR, Automattic! Only ever track people who explicitely opted in. Spare us your advertisements. -- We look at you, Automattic. We are the WordPress people - we have power, we stand up, we raise our voices.
emiruz/textextract 8 0 0 over 7 years ago 0 0 mit Go
textextract is a tiny library (87 lines of Go) that identifies where the article content is in a HTML page (as opposed to navigation, headers, footers, ads, etc), extracts it and returns it as a string. Like Boilerpipe but for Go in Go.
Alternatives To emiruz/textextract
Select To Compare


Alternative Project Comparisons
Popular Ads Projects
Popular Article Projects
Popular Advertising Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.