Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned Loading

  1. dedupe Public

    🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 4.3k 562

  2. csvdedupe Public

    🆔 Command line tool for deduplicating CSV files

    Python 423 83

  3. dedupe-examples Public

    🆔 Examples for using the dedupe library

    Python 413 216

  4. affinegap Public

    📐 A Cython implementation of the affine gap string distance

    Cython 57 10

  5. pyhacrf Public

    Forked from dirko/pyhacrf

    📐 Hidden alignment conditional random field for classifying string pairs.

    Python 24 12

  6. doublemetaphone Public

    🔉 Python wrapper for a C++ Double Metaphone

    C++ 15 9

Repositories

Showing 10 of 32 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…