| Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
|---|---|---|---|---|---|---|---|---|---|---|
| mattilyra/LSH | 243 | 0 | 0 | almost 3 years ago | 0 | 12 | mit | Python | ||
| Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents | ||||||||||
| davidsvy/Neural-Scam-Artist | 15 | 0 | 0 | over 4 years ago | 0 | 0 | mit | Python | ||
| Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset. | ||||||||||
| zyocum/dedup | 10 | 0 | 0 | about 3 years ago | 0 | 0 | mit | Python | ||
| Find duplicate text files. | ||||||||||
| linhongseba/Product-Deduplication | 7 | 0 | 0 | over 7 years ago | 0 | 0 | apache-2.0 | HTML | ||
| A practical implementation for product deduplication using TFIDF and Super Bit LSH | ||||||||||
| chr1st1ank/narrow-down | 6 | 0 | 0 | almost 3 years ago | 18 | May 01, 2023 | 10 | apache-2.0 | Python | |
| Fast fuzzy text search | ||||||||||