| moj-analytical-services/splink |
939 |
|
0 |
2 |
about 2 years ago |
119 |
November 14, 2023 |
167 |
mit |
Python |
| Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends |
| zouzias/spark-lucenerdd |
127 |
|
0 |
0 |
over 2 years ago |
39 |
June 02, 2021 |
36 |
apache-2.0 |
Scala |
| Spark RDD with Lucene's query and entity linkage capabilities |
| cleanzr/dblink |
38 |
|
0 |
0 |
almost 5 years ago |
0 |
|
4 |
other |
Scala |
| Distributed Bayesian Entity Resolution in Apache Spark |
| ing-bank/spark-matcher |
27 |
|
0 |
0 |
over 2 years ago |
0 |
|
5 |
gpl-2.0 |
Python |
| Record matching and entity resolution at scale in Spark |
| pranab/whakapai |
22 |
|
0 |
0 |
about 2 years ago |
0 |
|
1 |
|
Python |
| Various Python Data Science Projects available in PyPi |
| zouzias/spark-lucenerdd-examples |
15 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
apache-2.0 |
Scala |
| Examples of spark-lucenerdd |
| pierrepita/atyimo |
7 |
|
0 |
0 |
almost 7 years ago |
0 |
|
0 |
mit |
Python |
| moj-analytical-services/splink_graph |
6 |
|
0 |
0 |
about 3 years ago |
41 |
March 14, 2022 |
8 |
mit |
HTML |
| pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains) |