| chiphuyen/sotawhat |
1,280 |
|
0 |
0 |
over 2 years ago |
0 |
|
18 |
|
Python |
| Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday. |
| jonatasgrosman/findpapers |
164 |
|
0 |
0 |
about 2 years ago |
24 |
June 22, 2021 |
3 |
mit |
Python |
| Findpapers: A tool for helping researchers who are looking for related works |
| complementizer/wcep-mds-dataset |
49 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
mit |
Python |
| englehardt/cookies-that-give-you-away |
44 |
|
0 |
0 |
over 7 years ago |
0 |
|
0 |
|
OpenEdge ABL |
| Code release for: Cookies that give you away: The surveillance implications of web tracking |
| aruneshmathur/dark-patterns |
30 |
|
0 |
0 |
over 6 years ago |
0 |
|
0 |
gpl-3.0 |
Jupyter Notebook |
| Code and data belonging to our CSCW 2019 paper: "Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites". |
| ChanChiChoi/tiny-crawler |
21 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
|
Python |
| download the links from libgen.io, arxiv |
| joelthchao/arxiv-crawler |
17 |
|
0 |
0 |
over 8 years ago |
0 |
|
0 |
|
Python |
| crawling arXiv paper and organize as a database |
| WING-NUS/Kairos |
17 |
|
0 |
0 |
about 15 years ago |
0 |
|
1 |
apache-2.0 |
Java |
| Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled with fields of metadata that correspond to individual papers. Using event date metadata extracted from the conference website, Kairos proactively harvests metadata about the individual papers soon after they are made public. We use a Maximum Entropy classifier to classify uniform resource locators (URLs) as scientific conference websites and use Conditional Random Fields (CRF) to extract individual paper metadata from such websites. The crawler is built on top of the popular open-source crawler Nutch. |
| JustJokerX/PaperCrawler |
16 |
|
0 |
0 |
over 7 years ago |
0 |
|
0 |
gpl-3.0 |
Python |
| Crawler used to crawl papers |
| yagol2020/PaperWebCrawler |
15 |
|
0 |
0 |
almost 3 years ago |
0 |
|
0 |
apache-2.0 |
Java |
| IEEE XPLORE等文献网站的爬虫工具/Crawler for Paper Website like IEEE XPLORE |