Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
The Top 10 Data Mining Open Source Projects
Open source projects categorized as Data Mining
Categories
>
Data Processing
>
Data Mining
Edit Category
eriklindernoren/ML-From-Scratch
⭐
22,560
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
academic/awesome-datascience
⭐
22,459
:memo: An awesome Data Science repository to learn and apply for real world problems.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
JaidedAI/EasyOCR
⭐
20,438
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
microsoft/LightGBM
⭐
15,819
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
EthicalML/awesome-production-machine-learning
⭐
15,344
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
dependent packages
0
total releases
0
most recent commit
about 2 years ago
piskvorky/gensim
⭐
14,915
Topic Modelling for Humans
dependent packages
0
total releases
0
most recent commit
over 2 years ago
rasbt/python-machine-learning-book
⭐
11,645
The "Python Machine Learning (1st edition)" book code repository and info resource
dependent packages
0
total releases
0
most recent commit
over 3 years ago
OpenRefine/OpenRefine
⭐
10,106
OpenRefine is a free, open source power tool for working with messy data and improving it
dependent packages
0
total releases
0
most recent commit
about 2 years ago
tangyudi/Ai-Learn
⭐
7,757
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
dependent packages
0
total releases
0
most recent commit
over 2 years ago
yzhao062/pyod
⭐
7,751
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
dependent packages
0
total releases
0
most recent commit
over 2 years ago
Get A Weekly Email With Trending Data Mining Projects
No Spam. Unsubscribe easily at any time.
Data Mining
Subscribe
Javascript must be enabled to subscribe.
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2026 Awesome Open Source. All rights reserved.