| titipata/pubmed_parser |
431 |
|
0 |
1 |
about 3 years ago |
2 |
November 22, 2021 |
14 |
mit |
Python |
| :clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset |
| diegoceccarelli/json-wikipedia |
244 |
|
0 |
0 |
over 4 years ago |
0 |
|
6 |
apache-2.0 |
Java |
| Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump |
| spencermountain/dumpster-dive |
214 |
|
1 |
2 |
almost 3 years ago |
34 |
July 04, 2023 |
8 |
other |
JavaScript |
| roll a wikipedia dump into mongo |
| soshial/xdxf_makedict |
211 |
|
0 |
0 |
about 3 years ago |
0 |
|
11 |
|
|
| XDXF — an open and free dictionary format, that stores word articles in a structural and semantic way. The most convertible format |
| jodaiber/Annotated-WikiExtractor |
88 |
|
0 |
0 |
about 15 years ago |
0 |
|
0 |
gpl-3.0 |
Python |
| Simple Wikipedia plain text extractor with article link annotations and Hadoop support. |
| 10up/eight-day-week |
82 |
|
0 |
0 |
over 2 years ago |
0 |
|
7 |
gpl-2.0 |
PHP |
| Optimize print publication workflows by using WordPress as your print CMS. |
| PLOS/allofplos |
53 |
|
0 |
0 |
over 2 years ago |
21 |
December 06, 2022 |
35 |
mit |
Python |
| Repository for the allofplos project. |
| Vitaliy-1/JATSParserPlugin |
25 |
|
0 |
0 |
almost 3 years ago |
0 |
|
35 |
gpl-3.0 |
PHP |
| OJS3 Plugin for parsing JATS XML and displaying it on article detail page |
| macbre/mediawiki-dump |
19 |
|
0 |
0 |
over 2 years ago |
0 |
|
5 |
mit |
Python |
| Python package for working with MediaWiki XML content dumps |
| joaoventura/WikiCorpusExtractor |
19 |
|
0 |
0 |
over 11 years ago |
0 |
|
0 |
|
Python |
| Extracts text from WikiMedia XML Dump files |