Jump to content
Server Maintenance This Week. ×

Techniques for Merging Duplicates


xochi

This topic is 6489 days old. Please don't post here. Open a new topic instead.

Recommended Posts

I have a database with a non-trivial number of records which are non-identical duplicates, but should be combined or cleaned up in some way. For example, a record for "John Johnson 11110043" and "John Q Johnson 11120043" that turn out, upon further examination, to be the same person, mistakenly entered twice with an error in their primary ID # and a missing middle name.

My question is, are there any "tried & true" ways of dealing with this. I've thought about some different ways:

  • Combine the relevant data from record A and record B into a new record C, with a new ID number. Mark A and B as "obsolete" or "unused".
  • Combine the relevant data from record A and record B into a new record C with a new ID number. Delete A and B.
  • Combine the relevant data from record A and record B into a new record C. Export A and B to a "deleted records" table, then delete A and B.
  • Combine the relevant data from record A and record B into the existing record B (same ID number). Delete A.
  • Etc.

I'm torn between the ideas of keeping all old data forever (even bad, duplicate data), and cleaning & pruning the data so that the current set is as "pure" as possible.

It seems like this is a common problem that others have dealt with. Any advice?

Link to comment
Share on other sites

To clarify: My question is not about Detecting the duplicate records (that problem, though difficult, is basically solved). My question is about what to do with the duplicates once they are discovered.

Added complexity: Both Record A and Record B may have related employment data that should be saved i the merge process. (e.g. a person works for a while as John Johnson, leaves for a bit, then returns and works under the new name / ID. The merged record ideally should include their entire work history from both records.

Link to comment
Share on other sites

This topic is 6489 days old. Please don't post here. Open a new topic instead.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...

Important Information

By using this site, you agree to our Terms of Use.