Removing duplicate files

J

John

Hi

Is there a good utility to detect duplicate files based on content and then
to remove them?

Thanks

Regards
 
P

Pegasus [MVP]

John said:
Hi

Is there a good utility to detect duplicate files based on content and
then to remove them?

Thanks

Regards

Google is your friend. Type the following words into a Google search box,
then take your pick:

Windows duplicate file finder
 
J

John

did before coming here...there are a number of them..can't figure which one
has the features I need..even installed and tried some as well..thought
someone may have a recommendation

regards
 
S

sgopus

Duplicate files are not compared as to content, they are compared to with
file name. It would be way IBM too complicated to compare the content to find
duplicate files.
 
J

John John - MVP

???

Real duplicate file finders will find duplicate files by contents, even
if the name and creation dates are different, it would be pretty useless
to search for duplicate by filename only! Real duplicate finders have
options to allow you to select if you want to have the filename or
creation date included in the duplicate file criteria.

John
 
E

ED

Not true.
Files are compared using the checksum value.
If CRC are equal, file contents must be equal/
 
S

sgopus

Thanks, I'm updated with that.

John John - MVP said:
???

Real duplicate file finders will find duplicate files by contents, even
if the name and creation dates are different, it would be pretty useless
to search for duplicate by filename only! Real duplicate finders have
options to allow you to select if you want to have the filename or
creation date included in the duplicate file criteria.

John

.
 
H

HeyBub

ED said:
Not true.
Files are compared using the checksum value.
If CRC are equal, file contents must be equal/

Almost certainly that's true, but not always.

If the check-sum is four bytes, there's a one-in-four-quadrillion chance the
files could differ but have the same check-sum.
 
P

Paul Randall

HeyBub said:
Almost certainly that's true, but not always.

If the check-sum is four bytes, there's a one-in-four-quadrillion chance
the files could differ but have the same check-sum.

Also, a malicious person could have modified the file in such a way that the
CRC is the same as it was before the modification. Other checksum
mechanism, such as MD5 were designed to be more difficult to spoof in this
way, but it turned out to be not that much more difficult. To be really
sure that the content of two files is identical, one might verify that the
CRC, MD5, and SHA checksums are all identical for the two files.

-Paul Randall
 
B

Bob I

Paul said:
Also, a malicious person could have modified the file in such a way that the
CRC is the same as it was before the modification. Other checksum
mechanism, such as MD5 were designed to be more difficult to spoof in this
way, but it turned out to be not that much more difficult. To be really
sure that the content of two files is identical, one might verify that the
CRC, MD5, and SHA checksums are all identical for the two files.

-Paul Randall

See FC with the /b switch in Windows Help.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top