Removing duplicate data while keeping one of every set

Nate · Jan 29, 2009

I have a client list of 60,000 with aprox. 9,000 duplicated names and
adresses. I need to remove the sets of duplicated data, while keeping one of
each set. Any help would be great.

Thank you

Ken Sheridan · Jan 29, 2009

Firstly, if you don't have one already, add an autonumber column to the table
to uniquely identify each row. Then use a query along these lines:

DELETE *
FROM Contacts AS C1
WHERE ContactID <>
(SELECT MIN(ContactID)
FROM Contacts AS C2
WHERE C2.LastName = C1.LastName
AND C2.FirstName = C1.FirstName
AND C2.AddressLine1 = C1.AddressLine1);

where ContactID is the autonumber column.

The above example assumes that duplication is identified by rows having the
same values in the LastName, FirstName and AddressLine1 columns, but you'll
be able to amend it easily if the basis of duplication is some other set of
columns. It also assumes of course that the values in these columns are
*exactly* the same.

AS always with this sort of operation its imperative that the table be
backed-up first of course.

Ken Sheridan
Stafford, England

Arvin Meyer [MVP] · Jan 29, 2009

Try this Knowledge Base article:

http://support.microsoft.com/kb/209183

Danny Lesandrini · Jan 29, 2009

Try this link ... there are a couple of ideas out here

http://www.amazecreations.com/datafast/ShowArticle.aspx?File=Articles/deleteduplicates.htm

Larry Linson · Jan 29, 2009

Nate said:
I have a client list of 60,000 with aprox. 9,000
duplicated names and adresses. I need to remove
the sets of duplicated data, while keeping one of
each set. Any help would be great.

A problem sometimes encountered in similar situations is where you do not
have actual "duplicated" records, but multiple records for the same "entity"
(in your case, client) but with different data about the entity -- then it
is not so easy to choose which one to keep.

Larry Linson
Microsoft Office Access MVP

Access Auto Matching Duplicates?	0	Jul 26, 2017
Duplicate values	4	Sep 17, 2009
Return every other row?	8	Oct 17, 2008
Removing duplicates	3	Oct 23, 2007
Removing duplicate detail records	3	Oct 23, 2007
Duplicate values in the index, primary key, or relationship messag	2	Dec 17, 2009
duplicates error after 2 years- why?	5	Feb 4, 2010
Excel Subtotal without calculate duplicate lines	4	Feb 22, 2017

Removing duplicate data while keeping one of every set

Nate

Ken Sheridan

Arvin Meyer [MVP]

Danny Lesandrini

Larry Linson

Ask a Question

Similar Threads