G
Guest
Hi,
I am trying to write a query which will spot duplicates. There are times
when the shipping department gets a duplicate sales receipt. The data source
is a csv file that UPS Worldship creates for processing packages. There is a
limitation to the data. What it gives me is ReferenceNumber which we use to
place a sales receipt number, CustomerName, Address, City, State, Zip and
NumberOfPackages. The Customer name is selected from a drop down and it
fills in the rest of the address info. The number of packages is numeric and
usually 1 but sometimes 2 or 3. The problem is that when there are two
packages being sent to someone, it will list the record twice. So the data
for this record would look like this:
ReferenceNumber CustomerName Address City State Zip
NumberPgk
232323 Joe Smith 111 Main Street Atlanta GA
30303 2
232323 Joe Smith 111 Main Street Atlanta GA
30303 2
for all of the shipments that only have one package, there is one record
with number of packages being 1. If there had been a duplicate with
shipments of one package then I could run a cross tab and spot records that
appear twice but because the shipments that have 2 or more packages have
duplicate (or triplicate) info, the cross tab would pick this up as a false
duplicate. If the above record to Joe Smith had been a duplicate, there
would be four records rather than two.
Can someone help?
Thanks,
I am trying to write a query which will spot duplicates. There are times
when the shipping department gets a duplicate sales receipt. The data source
is a csv file that UPS Worldship creates for processing packages. There is a
limitation to the data. What it gives me is ReferenceNumber which we use to
place a sales receipt number, CustomerName, Address, City, State, Zip and
NumberOfPackages. The Customer name is selected from a drop down and it
fills in the rest of the address info. The number of packages is numeric and
usually 1 but sometimes 2 or 3. The problem is that when there are two
packages being sent to someone, it will list the record twice. So the data
for this record would look like this:
ReferenceNumber CustomerName Address City State Zip
NumberPgk
232323 Joe Smith 111 Main Street Atlanta GA
30303 2
232323 Joe Smith 111 Main Street Atlanta GA
30303 2
for all of the shipments that only have one package, there is one record
with number of packages being 1. If there had been a duplicate with
shipments of one package then I could run a cross tab and spot records that
appear twice but because the shipments that have 2 or more packages have
duplicate (or triplicate) info, the cross tab would pick this up as a false
duplicate. If the above record to Joe Smith had been a duplicate, there
would be four records rather than two.
Can someone help?
Thanks,