| ICIJ/extract |
229 |
|
0 |
1 |
about 2 years ago |
58 |
November 13, 2023 |
10 |
mit |
Java |
| A cross-platform command line tool for parallelised content extraction and analysis. |
| dalenewman/Transformalize |
153 |
|
18 |
45 |
over 2 years ago |
49 |
April 21, 2020 |
7 |
other |
C# |
| Configurable Extract, Transform, and Load |
| agile-lab-dev/wasp |
25 |
|
0 |
15 |
over 2 years ago |
25 |
September 14, 2023 |
4 |
other |
Scala |
| WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you. |
| relwell/chompsky |
7 |
|
0 |
0 |
over 13 years ago |
0 |
|
0 |
|
Java |
| An NLP pipeline for Wikia data |
| linkedpipes/dcat-ap-viewer |
6 |
|
0 |
0 |
over 2 years ago |
0 |
|
26 |
mit |
JavaScript |
| Viewer of DCAT-AP 2.0.1 compatible dataset metadata |
| GuilhermeViterboGalvao/solrMongoDBDataImporter |
5 |
|
0 |
0 |
over 4 years ago |
0 |
|
0 |
|
Java |
| Solr MongoDB Data Import |