| crawler-commons/crawler-commons |
217 |
|
26 |
7 |
over 2 years ago |
10 |
July 13, 2023 |
30 |
apache-2.0 |
Java |
| A set of reusable Java components that implement functionality common to any web crawler |
| mediacloud/ultimate-sitemap-parser |
76 |
|
0 |
1 |
about 5 years ago |
5 |
July 31, 2019 |
8 |
other |
Python |
| Ultimate Website Sitemap Parser |
| VIPnytt/SitemapParser |
62 |
|
3 |
6 |
over 2 years ago |
15 |
November 27, 2023 |
0 |
mit |
PHP |
| XML Sitemap parser class compliant with the Sitemaps.org protocol. |
| snabb/sitemap |
39 |
|
2 |
15 |
about 3 years ago |
5 |
February 24, 2023 |
1 |
mit |
Go |
| Go XML sitemap and sitemapindex package (golang) |
| andreisavu/python-sitemap |
33 |
|
0 |
0 |
almost 12 years ago |
0 |
|
3 |
apache-2.0 |
Python |
| Python library for parsing & generating sitemaps |
| oxffaa/gopher-parse-sitemap |
28 |
|
0 |
0 |
about 3 years ago |
0 |
|
4 |
mit |
Go |
| A high effective golang library for parsing big-sized sitemaps and avoiding high memory usage. The sitemap parser was written on golang without external dependencies. |
| evanderkoogh/node-sitemap-stream-parser |
26 |
|
4 |
9 |
almost 6 years ago |
15 |
February 21, 2019 |
14 |
apache-2.0 |
CoffeeScript |
| A streaming parser for sitemap files. Is able to deal with deeply nested sitemaps with 100+ million urls in them. |
| benbalter/sitemap-parser |
26 |
|
8 |
4 |
about 2 years ago |
10 |
December 14, 2021 |
7 |
mit |
Ruby |
| Ruby Gem to parse sitemaps.org compliant sitemaps |
| TurnerSoftware/SitemapTools |
23 |
|
1 |
1 |
over 2 years ago |
9 |
August 10, 2022 |
2 |
mit |
C# |
| A sitemap (sitemap.xml) querying and parsing library for .NET |
| VIPnytt/RobotsTxtParser |
21 |
|
4 |
5 |
about 5 years ago |
12 |
April 10, 2021 |
0 |
mit |
PHP |
| An extensible robots.txt parser and client library, with full support for every directive and specification. |