| TileDB-Inc/TileDB |
1,700 |
|
0 |
6 |
about 2 years ago |
87 |
November 05, 2022 |
133 |
mit |
C++ |
| The Universal Storage Engine |
| astronomer/astro-sdk |
303 |
|
0 |
2 |
about 2 years ago |
49 |
August 30, 2023 |
153 |
apache-2.0 |
Python |
| Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow. |
| svenkreiss/pysparkling |
253 |
|
7 |
1 |
over 3 years ago |
69 |
November 13, 2022 |
9 |
other |
Python |
| A pure Python implementation of Apache Spark's RDD and DStream interfaces. |
| RumbleDB/rumble |
194 |
|
0 |
0 |
almost 3 years ago |
4 |
December 03, 2019 |
134 |
other |
Java |
| ⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more |
| jay-johnson/sci-pype |
96 |
|
0 |
0 |
about 6 years ago |
0 |
|
3 |
apache-2.0 |
Python |
| A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository. |
| urigoren/decorators4DS |
27 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
mit |
Python |
| Useful decorators every Data Scientist should know |
| nodestream-proj/nodestream |
23 |
|
0 |
0 |
about 2 years ago |
0 |
|
25 |
apache-2.0 |
Python |
| A Fast, Declarative, and Extensible ETL Framework for Graph Databases. |
| yaojiach/red-panda |
18 |
|
0 |
0 |
almost 4 years ago |
0 |
|
|
mit |
Python |
| Easily interact with cloud (AWS) in your Data Science workflow. |
| ClusterlessHQ/tessellate |
5 |
|
0 |
0 |
over 2 years ago |
0 |
|
5 |
other |
Java |
| A data engineering cli for reading and writing data to/from multiple locations across multiple formats. |