| openvenues/libpostal |
3,897 |
|
0 |
0 |
about 2 years ago |
0 |
|
315 |
mit |
C |
| A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. |
| dedupeio/dedupe |
3,879 |
|
39 |
10 |
over 2 years ago |
174 |
February 17, 2023 |
72 |
mit |
Python |
| :id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution. |
| moj-analytical-services/splink |
939 |
|
0 |
2 |
about 2 years ago |
119 |
November 14, 2023 |
167 |
mit |
Python |
| Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends |
| J535D165/recordlinkage |
808 |
|
9 |
3 |
over 2 years ago |
23 |
July 20, 2023 |
57 |
bsd-3-clause |
Python |
| A powerful and modular toolkit for record linkage and duplicate detection in Python |
| Yomguithereal/talisman |
666 |
|
1,135 |
48 |
about 3 years ago |
30 |
January 21, 2021 |
80 |
mit |
JavaScript |
| Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript. |
| dedupeio/csvdedupe |
393 |
|
0 |
0 |
about 6 years ago |
0 |
|
21 |
other |
Python |
| :id: Command line tool for deduplicating CSV files |
| J535D165/data-matching-software |
329 |
|
0 |
0 |
over 2 years ago |
0 |
|
8 |
|
|
| A list of free data matching and record linkage software. |
| dedupeio/dedupe-examples |
306 |
|
0 |
0 |
about 4 years ago |
0 |
|
7 |
mit |
Python |
| :id: Examples for using the dedupe library |
| zouzias/spark-lucenerdd |
127 |
|
0 |
0 |
about 2 years ago |
39 |
June 02, 2021 |
36 |
apache-2.0 |
Scala |
| Spark RDD with Lucene's query and entity linkage capabilities |
| vintasoftware/entity-embed |
98 |
|
0 |
0 |
almost 4 years ago |
6 |
July 16, 2021 |
0 |
mit |
Jupyter Notebook |
| PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors. |