Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
The Top 10 Data Cleaning Open Source Projects
Open source projects categorized as Data Cleaning
Categories
>
Data Processing
>
Data Cleaning
Edit Category
OpenRefine/OpenRefine
⭐
10,106
OpenRefine is a free, open source power tool for working with messy data and improving it
dependent packages
0
total releases
0
most recent commit
about 2 years ago
great-expectations/great_expectations
⭐
9,179
Always know what to expect from your data.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
johnkerl/miller
⭐
8,397
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
dependent packages
0
total releases
0
most recent commit
about 2 years ago
cleanlab/cleanlab
⭐
7,747
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
unionai-oss/pandera
⭐
2,807
A light-weight, flexible, and expressive statistical data testing library
dependent packages
0
total releases
0
most recent commit
about 2 years ago
justmarkham/pandas-videos
⭐
1,808
Jupyter notebook and datasets from the pandas Q&A video series
dependent packages
0
total releases
0
most recent commit
almost 4 years ago
sfu-db/dataprep
⭐
1,807
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
skrub-data/skrub
⭐
1,591
Machine learning with dataframes
dependent packages
0
total releases
0
most recent commit
4 days ago
justmarkham/DAT8
⭐
1,549
General Assembly's 2015 Data Science course in Washington, DC
dependent packages
0
total releases
0
most recent commit
over 3 years ago
hi-primus/optimus
⭐
1,540
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
dependent packages
0
total releases
0
most recent commit
over 1 year ago
Get A Weekly Email With Trending Data Cleaning Projects
No Spam. Unsubscribe easily at any time.
Data Cleaning
Subscribe
Javascript must be enabled to subscribe.
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2026 Awesome Open Source. All rights reserved.