| cython/cython |
8,667 |
|
13,709 |
2,472 |
about 2 years ago |
135 |
November 26, 2023 |
1,272 |
apache-2.0 |
Python |
| The most widely used Python to C compiler |
| tensorbase/tensorbase |
1,217 |
|
0 |
0 |
almost 4 years ago |
0 |
|
47 |
apache-2.0 |
Rust |
| TensorBase is a new big data warehousing with modern efforts. |
| GraphChi/graphchi-cpp |
710 |
|
0 |
0 |
over 7 years ago |
0 |
|
32 |
|
C++ |
| GraphChi's C++ version. Big Data - small machine. |
| minio/sidekick |
503 |
|
0 |
0 |
over 2 years ago |
34 |
February 14, 2022 |
1 |
agpl-3.0 |
Go |
| High Performance HTTP Sidecar Load Balancer |
| apache/incubator-wayang |
162 |
|
1 |
18 |
about 2 years ago |
4 |
June 24, 2025 |
84 |
apache-2.0 |
Java |
| Apache Wayang(incubating) is the first cross-platform data processing system. |
| MemVerge/splash |
86 |
|
0 |
0 |
over 6 years ago |
0 |
|
3 |
apache-2.0 |
Scala |
| Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange |
| lsds/StreamBench |
41 |
|
0 |
0 |
almost 7 years ago |
0 |
|
2 |
apache-2.0 |
C++ |
| Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark |
| aronszanto/sLSM-Tree |
29 |
|
0 |
0 |
over 8 years ago |
0 |
|
0 |
gpl-3.0 |
C++ |
| High-Performance C++ Data System |
| openucx/sparkucx |
23 |
|
0 |
0 |
over 4 years ago |
0 |
|
6 |
bsd-3-clause |
Scala |
| A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer |
| Databeans/lighthouse |
8 |
|
0 |
0 |
over 2 years ago |
1 |
May 31, 2023 |
0 |
mit |
Scala |
| Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations should be performed. |