Get rid of "Duplicate Records" in a table

ali · Mar 5, 2008

I have a table: (Dummy Scenario)

-Auto_number
-Employee_ID
-Name
-Join_Date
-Employee_Type
-Position

--------------------------------------------------------------------------------------
Problem:

Due to some importation reasons.... some entries are duplicated. They have
exactly same data but they occupy more than 1 row, maybe 2 or three

--------------------------------------------------------------------------------------
I want to:

1) Retrieve those duplicated entries.
2) Make all entries in my table unique by getting rid off them.

Dear experts, thanks a lot !

Allen Browne · Mar 5, 2008

Use a subquery to see if there is a "duplicate". This kind of thing:

DELETE FROM Table1
WHERE [Auto_number] >
(SELECT Min([Auto_number] AS MinID
FROM Table1 AS Dupe
WHERE Dupe.[Employee_ID] = Table1.[Employee_ID]
AND Dupe.[Name] = Table1.[Name]
AND ...
AND Dupe.[Postion] = Table1.[Position]);

The core idea here is to retain the lowest Auto_number value. If there are
other Auto_number values that are higher than the lowest one and matches on
all the fields that define "duplicate", then dump those records. The WHERE
clause of the subquery will need to contain all the fields that define
"duplicate" for you.

If subqueries are new, here's an introduction:
http://allenbrowne.com/subquery-01.html

You stated that this is a dummy scenario, but I'll just mention that you
could run into problems using Name and Position as field names. Here's a
list of the names to avoid:
http://allenbrowne.com/AppIssueBadWord.html

ali · Mar 10, 2008

Great method !, it works ! and big thanks for the tips !

However,

I have another table without "primary keys" (i know it goes against DB
design therory!, but it just happens !),

If i do not have "primary key", how can i get rid of duplicate data ?

Thanks a lot allen !

--
Allen Phailat Wongakanit

Allen Browne said:
Use a subquery to see if there is a "duplicate". This kind of thing:

DELETE FROM Table1
WHERE [Auto_number] >
(SELECT Min([Auto_number] AS MinID
FROM Table1 AS Dupe
WHERE Dupe.[Employee_ID] = Table1.[Employee_ID]
AND Dupe.[Name] = Table1.[Name]
AND ...
AND Dupe.[Postion] = Table1.[Position]);

The core idea here is to retain the lowest Auto_number value. If there are
other Auto_number values that are higher than the lowest one and matches on
all the fields that define "duplicate", then dump those records. The WHERE
clause of the subquery will need to contain all the fields that define
"duplicate" for you.

If subqueries are new, here's an introduction:
http://allenbrowne.com/subquery-01.html

You stated that this is a dummy scenario, but I'll just mention that you
could run into problems using Name and Position as field names. Here's a
list of the names to avoid:
http://allenbrowne.com/AppIssueBadWord.html

--
Allen Browne - Microsoft MVP. Perth, Western Australia

Reply to group, rather than allenbrowne at mvps dot org.

ali said:

I have a table: (Dummy Scenario)

-Auto_number
-Employee_ID
-Name
-Join_Date
-Employee_Type
-Position

--------------------------------------------------------------------------------------
Problem:

Due to some importation reasons.... some entries are duplicated. They have
exactly same data but they occupy more than 1 row, maybe 2 or three

--------------------------------------------------------------------------------------
I want to:

1) Retrieve those duplicated entries.
2) Make all entries in my table unique by getting rid off them.

Dear experts, thanks a lot !

Click to expand...

Allen Browne · Mar 10, 2008

Add a primary key

If there is no way to distingish between duplicates, there is no way to
instruct Access which one to delete.

Add a primary key.

If you don't want to do that, create a query that deduplicates the data
(GROUP BY), and turn that into a Make Table query so you end up with a table
that is de-duplicated.

--
Allen Browne - Microsoft MVP. Perth, Western Australia

Reply to group, rather than allenbrowne at mvps dot org.

ali said:
Great method !, it works ! and big thanks for the tips !

However,

I have another table without "primary keys" (i know it goes against DB
design therory!, but it just happens !),

If i do not have "primary key", how can i get rid of duplicate data ?

Thanks a lot allen !

--
Allen Phailat Wongakanit

Allen Browne said:

Use a subquery to see if there is a "duplicate". This kind of thing:

DELETE FROM Table1
WHERE [Auto_number] >
(SELECT Min([Auto_number] AS MinID
FROM Table1 AS Dupe
WHERE Dupe.[Employee_ID] = Table1.[Employee_ID]
AND Dupe.[Name] = Table1.[Name]
AND ...
AND Dupe.[Postion] = Table1.[Position]);

The core idea here is to retain the lowest Auto_number value. If there
are
other Auto_number values that are higher than the lowest one and matches
on
all the fields that define "duplicate", then dump those records. The
WHERE
clause of the subquery will need to contain all the fields that define
"duplicate" for you.

If subqueries are new, here's an introduction:
http://allenbrowne.com/subquery-01.html

You stated that this is a dummy scenario, but I'll just mention that you
could run into problems using Name and Position as field names. Here's a
list of the names to avoid:
http://allenbrowne.com/AppIssueBadWord.html

ali said:

I have a table: (Dummy Scenario)

-Auto_number
-Employee_ID
-Name
-Join_Date
-Employee_Type
-Position

--------------------------------------------------------------------------------------
Problem:

Due to some importation reasons.... some entries are duplicated. They
have
exactly same data but they occupy more than 1 row, maybe 2 or three

--------------------------------------------------------------------------------------
I want to:

1) Retrieve those duplicated entries.
2) Make all entries in my table unique by getting rid off them.

Dear experts, thanks a lot !

Click to expand...

Click to expand...

Duplicate Entries	4	Feb 5, 2007
Need a query to get rid of duplicates using two tables	1	Dec 11, 2009
Deleting duplicate records/same table	7	May 6, 2008
Finding a count of duplicates	4	Jul 29, 2009
Deleting Duplicate Records in a Query	5	Mar 18, 2009
Duplicate value & delete	8	Apr 8, 2010
retrieving duplicate records from multiple tables	1	Jun 22, 2009
return all entries where name contains "LEN"	1	Mar 5, 2008

Get rid of "Duplicate Records" in a table

ali

Allen Browne

ali

Allen Browne

Ask a Question

Similar Threads