Dedupe Alternatives

Java DSL for (online) deduplication
Suggest Alternative
Alternatives To bakdata/dedupe
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
J535D165/recordlinkage 808 9 3 over 2 years ago 23 July 20, 2023 57 bsd-3-clause Python
A powerful and modular toolkit for record linkage and duplicate detection in Python
usc-isi-i2/rltk 81 1 5 over 4 years ago 20 October 06, 2021 4 mit Python
Record Linkage ToolKit (Find and link entities)
zhao1701/extending-deep-ER 18 0 0 almost 8 years ago 0 1 Jupyter Notebook
This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs on benchmark datasets under a variety of conditions and also tests a number of extensions designed to improve DeepER's accuracy.
gpoulter/pydedupe 16 0 0 almost 9 years ago 0 0 gpl-3.0 Python
Python dedupe library using in Mocality
bakdata/dedupe 13 0 1 over 4 years ago 7 September 28, 2021 3 mit Java
Java DSL for (online) deduplication
google/unisim 10 0 0 over 2 years ago 0 2 mit Python
UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
rif/imgdup 10 0 0 almost 11 years ago 6 May 08, 2015 0 mit Python
Visual similarity image finder and cleaner
linhongseba/Product-Deduplication 7 0 0 over 7 years ago 0 0 apache-2.0 HTML
A practical implementation for product deduplication using TFIDF and Super Bit LSH
jacobmarks/image-deduplication-plugin 6 0 0 over 2 years ago 0 1 Python
Remove exact and approximate duplicates from your dataset in FiftyOne!
Alternatives To bakdata/dedupe
Select To Compare


Alternative Project Comparisons
Popular Deduplication Projects
Popular Similarity Projects
Popular Data Processing Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.