Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
The Top 10 Data Processing Open Source Projects
Open source projects categorized as Data Processing
Categories
>
Data Processing
>
Data Processing
Edit Category
onceupon/Bash-Oneliner
⭐
10,710
A collection of handy Bash One-Liners and terminal tricks for data processing and Linux system maintenance.
dependent packages
0
total releases
0
most recent commit
3 months ago
johnkerl/miller
⭐
8,397
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
dependent packages
0
total releases
0
most recent commit
about 2 years ago
lorien/awesome-web-scraping
⭐
6,060
List of libraries, tools and APIs for web scraping and data processing.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
NVIDIA/DALI
⭐
4,770
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
TomWright/dasel
⭐
4,695
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
unionai-oss/pandera
⭐
2,807
A light-weight, flexible, and expressive statistical data testing library
dependent packages
0
total releases
0
most recent commit
about 2 years ago
dashbitco/broadway
⭐
2,608
Concurrent and multi-stage data ingestion and data processing with Elixir
dependent packages
0
total releases
0
most recent commit
3 months ago
microsoft/DialoGPT
⭐
2,283
Large-scale pretraining for dialogue
dependent packages
0
total releases
0
most recent commit
over 3 years ago
asyml/texar
⭐
2,008
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow
dependent packages
0
total releases
0
most recent commit
over 5 years ago
python-bonobo/bonobo
⭐
1,604
Extract Transform Load for Python 3.5+
dependent packages
0
total releases
0
most recent commit
almost 3 years ago
Get A Weekly Email With Trending Data Processing Projects
No Spam. Unsubscribe easily at any time.
Data Processing
Subscribe
Javascript must be enabled to subscribe.
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2026 Awesome Open Source. All rights reserved.