| ferventdesert/Hawk |
2,638 |
|
0 |
0 |
over 6 years ago |
0 |
|
65 |
apache-2.0 |
C# |
| visualized crawler & ETL IDE written with C#/WPF |
| ferventdesert/etlpy |
393 |
|
0 |
0 |
over 6 years ago |
0 |
|
8 |
apache-2.0 |
Python |
| a smart stream-like crawler & etl python library |
| wx-chevalier/sentinel-crawler |
122 |
|
0 |
2 |
almost 3 years ago |
20 |
July 07, 2017 |
33 |
mit |
JavaScript |
| Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure :dizzy: 多语言执行器,分布式爬虫 |
| dalenewman/SQLoogle |
21 |
|
0 |
0 |
over 7 years ago |
0 |
|
0 |
apache-2.0 |
C# |
| Crawl, Index, and Search Your SQL. |
| awslabs/amazon-s3-step-functions-ingestion-orchestration |
19 |
|
0 |
0 |
over 6 years ago |
0 |
|
0 |
apache-2.0 |
Python |
| Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amazon S3 datalake bucket |
| openeduhub/oeh-search-etl |
7 |
|
0 |
0 |
over 2 years ago |
0 |
|
10 |
|
Python |
| The Backend includes all data for the ETL process (Scrapy, Postgres, Elasticsearch) |
| LuQQiu/NutchPigHive |
5 |
|
0 |
0 |
about 9 years ago |
0 |
|
0 |
|
Java |
| crawl GooglePlay data with Nutch, ETL with Pig, analyze with Hive |