Dmoz Urlclassifier Alternatives

Preparing DMOZ dataset for my n-Gram LM-based URL classification research
Suggest Alternative
Alternatives To gr33ndata/dmoz-urlclassifier
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
simonw/datasette 8,621 35 173 about 2 years ago 140 October 08, 2023 556 apache-2.0 Python
An open source multi-tool for exploring and publishing data
lukes/ISO-3166-Countries-with-Regional-Codes 1,971 0 0 over 2 years ago 0 August 26, 2016 12 other Ruby
ISO 3166-1 country lists merged with their UN Geoscheme regional codes in ready-to-use JSON, XML, CSV data sets
yhenon/pytorch-retinanet 1,764 0 0 about 4 years ago 0 134 apache-2.0 Python
Pytorch implementation of RetinaNet object detection.
UniversalDataTool/universal-data-tool 1,612 0 0 almost 4 years ago 0 173 mit JavaScript
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
capitalone/DataProfiler 1,310 0 3 about 2 years ago 53 November 14, 2023 56 apache-2.0 Python
What's in your data? Extract schema, statistics and entities from datasets
metmuseum/openaccess 1,041 0 0 almost 3 years ago 0 24 cc0-1.0
The Metropolitan Museum of Art's Open Access Initiative
ashvardanian/StringZilla 999 0 0 about 2 years ago 5 November 19, 2023 16 apache-2.0 C
Up to 10x faster string search, split, sort, and shuffle for long strings and multi-gigabyte files in Python and C, leveraging SIMD with just a few lines of Arm Neon and x86 AVX2 & AVX-512 intrinsics 🦖
spytensor/prepare_detection_dataset 760 0 0 over 4 years ago 0 3 mit Python
convert dataset to coco/voc format
papyrussolution/UhttBarcodeReference 758 0 0 over 2 years ago 0 8
Universe-HTT barcode reference
amaboura/panama-papers-dataset-2016 701 0 0 almost 10 years ago 0 5 gpl-3.0 Jupyter Notebook
Structured data about Panama papers collected from official ICIJ website
Alternatives To gr33ndata/dmoz-urlclassifier
Select To Compare


Alternative Project Comparisons
Popular Dataset Projects
Popular Csv Projects
Popular Data Processing Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.