| apache/spark |
37,661 |
|
2,394 |
939 |
about 2 years ago |
46 |
May 09, 2021 |
186 |
apache-2.0 |
Scala |
| Apache Spark - A unified analytics engine for large-scale data processing |
| andkret/Cookbook |
12,557 |
|
0 |
0 |
over 2 years ago |
0 |
|
111 |
apache-2.0 |
|
| The Data Engineering Cookbook |
| wangzhiwubigdata/God-Of-BigData |
8,483 |
|
0 |
0 |
over 2 years ago |
0 |
|
3 |
|
|
| 专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive... |
| apache/zeppelin |
6,229 |
|
32 |
31 |
about 2 years ago |
2 |
June 21, 2017 |
160 |
apache-2.0 |
Java |
| Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more. |
| apache/iceberg |
5,179 |
|
0 |
0 |
about 2 years ago |
3 |
October 29, 2022 |
1,485 |
apache-2.0 |
Java |
| Apache Iceberg |
| intel-analytics/BigDL |
4,728 |
|
0 |
10 |
about 2 years ago |
16 |
April 19, 2021 |
958 |
apache-2.0 |
Jupyter Notebook |
| Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using bigdl-llm |
| JerryLead/SparkInternals |
4,665 |
|
0 |
0 |
over 4 years ago |
0 |
|
27 |
|
|
| Notes talking about the design and implementation of Apache Spark |
| yahoo/TensorFlowOnSpark |
3,851 |
|
5 |
0 |
almost 3 years ago |
32 |
April 21, 2022 |
13 |
apache-2.0 |
Python |
| TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters. |
| JohnSnowLabs/spark-nlp |
3,578 |
|
0 |
30 |
about 2 years ago |
134 |
December 08, 2023 |
43 |
apache-2.0 |
Scala |
| State of the Art Natural Language Processing |
| RoaringBitmap/RoaringBitmap |
3,308 |
|
435 |
124 |
about 2 years ago |
187 |
September 22, 2023 |
89 |
apache-2.0 |
Java |
| A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Tablesaw, and many others |