| donnemartin/data-science-ipython-notebooks |
25,668 |
|
0 |
0 |
over 2 years ago |
0 |
|
34 |
other |
Python |
| Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. |
| ibis-project/ibis |
3,404 |
|
24 |
29 |
about 2 years ago |
68 |
December 10, 2023 |
157 |
apache-2.0 |
Python |
| The flexibility of Python with the scale and performance of modern SQL. |
| databricks/koalas |
3,291 |
|
1 |
16 |
over 2 years ago |
47 |
October 19, 2021 |
112 |
apache-2.0 |
Python |
| Koalas: pandas API on Apache Spark |
| fugue-project/fugue |
1,821 |
|
0 |
23 |
about 2 years ago |
125 |
November 09, 2023 |
34 |
apache-2.0 |
Python |
| A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites. |
| delta-io/delta-sharing |
654 |
|
0 |
7 |
over 2 years ago |
33 |
December 02, 2023 |
74 |
apache-2.0 |
Scala |
| An open protocol for secure data sharing |
| jaystone776/python-data-science-cheatsheet |
590 |
|
0 |
0 |
over 7 years ago |
0 |
|
2 |
|
|
| Python数据科学速查表 |
| lyhue1991/eat_pyspark_in_10_days |
534 |
|
0 |
0 |
over 3 years ago |
0 |
|
1 |
|
Python |
| pyspark🍒🥭 is delicious,just eat it!😋😋 |
| polyaxon/traceml |
488 |
|
45 |
12 |
about 2 years ago |
10 |
November 25, 2021 |
6 |
apache-2.0 |
Python |
| Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon. |
| ing-bank/popmon |
461 |
|
0 |
2 |
over 2 years ago |
36 |
July 18, 2023 |
15 |
mit |
Python |
| Monitor the stability of a Pandas or Spark dataframe ⚙︎ |
| SuperCowPowers/zat |
409 |
|
0 |
1 |
about 2 years ago |
11 |
January 26, 2023 |
10 |
mit |
Jupyter Notebook |
| Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark |