G
Guest
We have an Access 2000 Database with Duplicate records. These records have
unique identifiers for the individuals (person or company.
The database is a registry, where individuals can register different
products. The duplicates have typically been caused by incorrect data entry
or re-entering related information for the same client for a new
product,without first searching for the existing client information (e.g.
Address).
We need to be able to identify these records, Link the information and then
delete the duplicate.
We have hypothesized on various ways the duplicate data was caused and have
come up with queries for identifying most of the records. We are however
battling with queries or scripts to identify (& extract or Group) duplicate
records the following nature
1. Duplicates caused, where the Name is exactly the same, but there
are slight differences in the address field
ID-PERSON NAME ADDRESS
1092260 MR GERNTZ, 45 ST. BARRY ROAD, DURBAN, 3423.
1094784 MR GERNTZ, 45 BARRY ROAD, DURBAN, 3423.
2. Duplicates caused, where the Address is exactly the same, but there
are slight differences in the Name field
ID-PERSON NAME ADDRESS
932 BITE-SOFT C.C. 8 HILL TERACE
MILNERTON 7441
933 BITESOFT C.C. 8 HILL TERACE
MILNERTON 7441
3. Duplicates caused, where the Name is exactly the same, but in one
record the address field is a Postal address, while in the other record the
address is a physical address.
ID-PERSON NAME ADDRESS
2345 MICHEAL ENGELS KIGELIA STR 53 FLORA
PARKBENONI0760
2087 MICHEAL ENGELS P O BOX 5852 BENONI 0760
4. Duplicates caused, where the Address is exactly the same, but in one
record within the Name field abbreviations are used and not used in the Name
field of the other record.
ID-PERSON NAME ADDRESS
3445 MADUMAS (PROPRIETARY) LIMITED P O BOX 6052 CAPE TOWN0360
3487 MADUMAS (PTY) LTD P O BOX 6052 CAPE
TOWN 0360
unique identifiers for the individuals (person or company.
The database is a registry, where individuals can register different
products. The duplicates have typically been caused by incorrect data entry
or re-entering related information for the same client for a new
product,without first searching for the existing client information (e.g.
Address).
We need to be able to identify these records, Link the information and then
delete the duplicate.
We have hypothesized on various ways the duplicate data was caused and have
come up with queries for identifying most of the records. We are however
battling with queries or scripts to identify (& extract or Group) duplicate
records the following nature
1. Duplicates caused, where the Name is exactly the same, but there
are slight differences in the address field
ID-PERSON NAME ADDRESS
1092260 MR GERNTZ, 45 ST. BARRY ROAD, DURBAN, 3423.
1094784 MR GERNTZ, 45 BARRY ROAD, DURBAN, 3423.
2. Duplicates caused, where the Address is exactly the same, but there
are slight differences in the Name field
ID-PERSON NAME ADDRESS
932 BITE-SOFT C.C. 8 HILL TERACE
MILNERTON 7441
933 BITESOFT C.C. 8 HILL TERACE
MILNERTON 7441
3. Duplicates caused, where the Name is exactly the same, but in one
record the address field is a Postal address, while in the other record the
address is a physical address.
ID-PERSON NAME ADDRESS
2345 MICHEAL ENGELS KIGELIA STR 53 FLORA
PARKBENONI0760
2087 MICHEAL ENGELS P O BOX 5852 BENONI 0760
4. Duplicates caused, where the Address is exactly the same, but in one
record within the Name field abbreviations are used and not used in the Name
field of the other record.
ID-PERSON NAME ADDRESS
3445 MADUMAS (PROPRIETARY) LIMITED P O BOX 6052 CAPE TOWN0360
3487 MADUMAS (PTY) LTD P O BOX 6052 CAPE
TOWN 0360