| drabastomek/learningPySpark |
409 |
|
0 |
0 |
about 7 years ago |
0 |
|
2 |
gpl-3.0 |
Jupyter Notebook |
| Code base for the Learning PySpark book (in preparation) |
| microsoft/Azure-Databricks-NYC-Taxi-Workshop |
80 |
|
0 |
0 |
over 3 years ago |
0 |
|
8 |
mit |
Scala |
| An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset |
| jleetutorial/python-spark-streaming |
73 |
|
0 |
0 |
about 8 years ago |
0 |
|
2 |
|
Jupyter Notebook |
| laserson/dsq |
39 |
|
0 |
0 |
about 12 years ago |
0 |
|
0 |
apache-2.0 |
Python |
| Distributed Streaming Quantiles (for PySpark) |
| kaantas/spark-twitter-sentiment-analysis |
33 |
|
0 |
0 |
over 7 years ago |
0 |
|
1 |
|
Python |
| Sentiment Analysis of a Twitter Topic with Spark Structured Streaming |
| stevenhurwitt/reddit-streaming |
18 |
|
0 |
0 |
almost 3 years ago |
0 |
|
0 |
|
Jupyter Notebook |
| streaming eight subreddits from reddit api using kafka producer & spark structured streaming. |
| kaantas/kafka-twitter-spark-streaming |
16 |
|
0 |
0 |
over 8 years ago |
0 |
|
1 |
|
Python |
| Counting Tweets Per User in Real-Time |
| AaronYang2333/DSCI_553 |
12 |
|
0 |
0 |
over 5 years ago |
0 |
|
0 |
|
ReScript |
| USC :v: 2020 Spring DSCI 553 (Foundations and Applications of Data Mining) 数据挖掘基础与应用 Score: :nine::four: |
| SignifAi/Spark-PubSub |
11 |
|
0 |
0 |
over 7 years ago |
0 |
|
2 |
apache-2.0 |
Java |
| Google Cloud Pubsub connector for Spark Streaming |
| bilal-elchami/dijkstra-hadoop-spark |
10 |
|
0 |
0 |
almost 8 years ago |
0 |
|
0 |
mit |
TeX |
| Dijkstra Algorithm - Python Hadoop Streaming and Pyspark |