Identifying Duplicates in Access ?

Davie · Jan 28, 2006

Lets assume I have a table with name, address, postcode in it.

id | name | address1 | address2 | postcode

2 jimbo 3 highstreet london lh34 4tn
45 jimbo 3 high street london lh43 4tn

Assume these are two duplicate users, albeit their address is different,
what is the easiest way of determining they are duplicates?

Guest · Jan 28, 2006

Hi Davie,

The easiest way is to use the Find Duplicates Query Wizard under New Query
in the Database window.
Regards.

Kjell Harnesk · Jan 28, 2006

Lets assume I have a table with name, address, postcode in it.

id | name | address1 | address2 | postcode

2 jimbo 3 highstreet london lh34 4tn
45 jimbo 3 high street london lh43 4tn

Assume these are two duplicate users, albeit their address is different,
what is the easiest way of determining they are duplicates?

Two people with same name but with different addresses can't be duplicates.
I think it´s illegal to clone people. In the example above the addresses
are the same and one of the records can be deleted because even if they in
fact represent two diffrent people the are the same datawise if there are
no other fields telling them apart.

John Marshall, MVP · Jan 28, 2006

Finding duplicates in name/address is not an easy task. There are a number
of issues that make an address appear different to a computer, but the
average user can spot them as being the same. You should pass the
information through a filter to remove some of the problems.
1) Mixed case -> change a copy of the data to all upper case or all lower
case.
2) Remove white space -> the fields may contain blanks or tabs within the
text or before and after.
3) Standardize on common words -> street to st, road to rd, private to pvt
etc

You can then use this temporary field (or fields) to find duplicates. You
may want to keep the fields seperate. In your example, three of the four
fields match after a cleanup. Only the postcode differs. The same would be
true of one of the address was missing the postcode. A three out of four
match may be close enough to be considered as a possible match.

Even if you do find duplicates it is possible that they are different people
living at the same address. (father /son)

John... Visio MVP

Kjell Harnesk · Jan 28, 2006

Lets assume I have a table with name, address, postcode in it.

id | name | address1 | address2 | postcode

2 jimbo 3 highstreet london lh34 4tn
45 jimbo 3 high street london lh43 4tn

Assume these are two duplicate users, albeit their address is different,
what is the easiest way of determining they are duplicates?

Sorry, I din not notice the small difference in the postcode.
But as John stated, it´s a difficult task and You can never be quite sure
without checking with the person(s) themselves.

Guest · Feb 14, 2006

What if I don't have access to the Duplicates Wizard and want to first: Get a
numerical value for the number of duplicate addresses in a table then second:
eliminate the duplicates so only a singel record for that address remains?

John Vinson · Feb 14, 2006

On Tue, 14 Feb 2006 13:31:26 -0800, "Campaign Staffer" <Campaign

What if I don't have access to the Duplicates Wizard and want to first: Get a
numerical value for the number of duplicate addresses in a table then second:
eliminate the duplicates so only a singel record for that address remains?

Create a Totals Query. Create a new query based on your table; select
the Primary Key field and whatever other fields jointly identify a
duplicate (e.g. you don't want to just use LastName to identify
duplicates because two people might have the same last name - or even
firstname, lastname and phonenumber, as in Joe Doakes and Joe Doakes
Jr.)

Group By the fields which constitute the dups, and select Count on the
totals row under the Primary Key. Put a criterion of >1 on this field
if you don't want to see singletons, only duplicates.

If there are additional fields which can legitimately be different
while still considering the record to be a duplicate, you'll need to
decide WHICH near-duplicate you want to keep. It may be simplest to
just use a continuous Form sorted by the duplicate-defining fields and
manually delete; or you can use a Delete Query using the duplicates
query as a subquery. This can be a bit tricky to set up and have it be
updateable; more details of your table structure and criteria for
deletion would be helpful!

John W. Vinson[MVP]

Concatenate part of two textboxes and then save the result	5	Mar 3, 2006
Find duplicates in address register	2	Mar 30, 2009
Populating Address fields in form	3	Jan 20, 2008
Remove duplicates in Access 2003 database	3	Mar 11, 2008
unbound txtboxes updating table	1	Oct 12, 2004
Generation of Queries to identify duplicates records in Access Dat	2	Apr 21, 2006
Duplicates in excel that aren't 100% DUPLICATES ...	4	May 1, 2008
Merging duplicate entries in Excel or deleting the duplicates (Exc	1	Apr 2, 2009

Identifying Duplicates in Access ?

Davie

Guest

Kjell Harnesk

John Marshall, MVP

Kjell Harnesk

Guest

John Vinson

Ask a Question

Similar Threads