| ibis-project/ibis |
3,404 |
|
24 |
29 |
about 2 years ago |
68 |
December 10, 2023 |
157 |
apache-2.0 |
Python |
| The flexibility of Python with the scale and performance of modern SQL. |
| lyhue1991/eat_pyspark_in_10_days |
534 |
|
0 |
0 |
over 3 years ago |
0 |
|
1 |
|
Python |
| pyspark🍒🥭 is delicious,just eat it!😋😋 |
| firmai/pandapy |
483 |
|
0 |
0 |
over 4 years ago |
22 |
January 25, 2020 |
2 |
|
Python |
| PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai) |
| capitalone/datacompy |
339 |
|
0 |
10 |
about 2 years ago |
20 |
November 15, 2023 |
16 |
apache-2.0 |
Python |
| Pandas and Spark DataFrame comparison for humans and more! |
| sparklingpandas/sparklingpandas |
338 |
|
1 |
0 |
over 8 years ago |
7 |
August 08, 2015 |
51 |
apache-2.0 |
Python |
| Sparkling Pandas |
| ThreatHuntingProject/hunter |
170 |
|
0 |
0 |
over 4 years ago |
0 |
|
0 |
mit |
Jupyter Notebook |
| A threat hunting / data analysis environment based on Python, Pandas, PySpark and Jupyter Notebook. |
| dvgodoy/handyspark |
129 |
|
0 |
0 |
almost 7 years ago |
7 |
May 19, 2019 |
8 |
mit |
Jupyter Notebook |
| HandySpark - bringing pandas-like capabilities to Spark dataframes |
| autodeployai/pypmml |
64 |
|
0 |
6 |
over 3 years ago |
15 |
November 03, 2022 |
4 |
apache-2.0 |
Python |
| Python PMML scoring library |
| canimus/cuallee |
56 |
|
0 |
1 |
about 2 years ago |
54 |
October 28, 2023 |
2 |
apache-2.0 |
Python |
| A data quality acceleration library to get data sets verified in a friendly interface |
| shauryashaurya/learn-data-munging |
37 |
|
0 |
0 |
about 2 years ago |
0 |
|
0 |
mit |
Jupyter Notebook |
| Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc. |