dupe file finder?

S

Spoon2001

Looking for a good dupe file finder. Here some that I've tried:

Dupeless - PCMag, not freeware anymore. Can't sort the results by size,
unless I use size as a criterion for identifying duplicate files. I really
want to be able to sort by size so that I can wipe out the bigger dupes
first (even if the dupes aren't the same size).

CloneSpy - gives me one set of duplicates at a time. I prefer a table list
of duplicate files like I get in Dupeless.

DupeLocater - taking forever to run - no ability to customize how files are
compared by different criteria (name, size, date, CRC, etc)
 
O

omega

Spoon2001 said:
Looking for a good dupe file finder. Here some that I've tried:

Possibly "Find Duplicates"?
http://www.david-taylor.myby.co.uk/software/disk.html#FindDuplicates
Dupeless - PCMag, not freeware anymore. Can't sort the results by size,
unless I use size as a criterion for identifying duplicate files. I really
want to be able to sort by size so that I can wipe out the bigger dupes
first (even if the dupes aren't the same size).

Find Duplicates sorts by size, biggest first. You can't sort any other way.
Yet if that's the sort order you want anyway, then that part should be fine.

Second limit with Find Duplicates. All you can do with the list it gives you
is click properties, or click the delete key, on what you select. No other
functions (eg open containing folder, save list, etc). Yet deletion is all
you've spoken of here.
CloneSpy - gives me one set of duplicates at a time. I prefer a table list
of duplicate files like I get in Dupeless.

Find Duplicates gives them out like Dupeless. The columns have size, date,
attributes, file version, product version. (The version info is something
I value.)

Something you don't have is auto-select options, or even the manual selection
checkboxes for deleting, as you do with Dupeless. In Find Duplicates, you
have to go through each file in the list one at a time, hitting the delete
key.
DupeLocater - taking forever to run -

Find Duplicates is massively slow, and heavy, a real elephant. It does a
contents compare that can't be turned off, part of the slowdown...
no ability to customize how files are
compared by different criteria (name, size, date, CRC, etc)

Find Duplicates options are on or off for: "Require same name; require
same date/time." Also in there is "require same size," but I can't say I
understand that setting, given that the program identifies duplicates by
same content. I've just opened the readme file, and see some stuff about
checksum that I've not dealt with, or even up to studying right now, so
I'll just paste the contents about that, let you see what you think from
there.

<quote Find Duplicates readme>

BASIC USAGE

Extract FindDupl.exe from the archive to a folder of your choice and run it.


HOW DOES Find Duplicates WORK?

Find Duplicates scans one or more disks on your system to find multiple
files, in a two-phase process. First it scans all the folders and sorts
all the files it finds into size order (files HAVE to be the same size to
be identical - yes?) You can limit the scan to one folder tree, if you wish.
It then compares files of the same size to see if the contents are actually
identical, and lists identical files by size order. You can then double-
click on any file to examine its properties, and optionally move it to the
recycle bin.


HOW IS THE PROCESS SPEEDED UP?

This process can take some time, so Find Duplicates will first perform one of
two preliminary checks to see if the files might actually be identical
without having to actually examine the whole file. By default, it checks the
modification date and time of the files, and only compares the files byte-by-
byte if the timestamps are the same. But it is possible for two files to
have the same contents without having the same timestamp, so you can enable
an option whereby the first 512 bytes of each file are checksummed. This
improves the recognition of identical files, but it is slower, and since it
involves a file access, the file's last access date will be altered.

By default, the timestamp, not the checksum comparison is selected. In
either case, the filename is normally ignored, so simply renaming a file will
not hide the fact that it is a duplicate. The timestamp of zero size files
is ignored. If you wish, you can also require that duplicate files must have
the same file name. You may be rather surprised to discover what duplicates
by content actually exist in some popular office suites!

You can turn off the timestamp checking in favour of the slower checksum
method should you so wish. For example, if many identically sized and
timestamped files are found from the initial search, using just timestamps
might miss some duplicate files since the duplicates may not be adjacent in
the name ordered list produced by the folder scan. The program was not
designed for this sort of duplicate search, but will perform adequately with
timestamp checking turned off. You might also wish to disable timestamp
checking if you suspected that different products had installed identical
support DLLs.

</quote Find Duplicates readme>
 
S

Son Of Spy

Spoon2001 said:
Looking for a good dupe file finder. Here some that I've tried:

Dupeless - PCMag, not freeware anymore. Can't sort the results by size,
unless I use size as a criterion for identifying duplicate files. I really
want to be able to sort by size so that I can wipe out the bigger dupes
first (even if the dupes aren't the same size).

CloneSpy - gives me one set of duplicates at a time. I prefer a table list
of duplicate files like I get in Dupeless.

DupeLocater - taking forever to run - no ability to customize how files are
compared by different criteria (name, size, date, CRC, etc)
EasyCleaner
1.7 or 2.0beta
http://www.toniarts.com/files/EClea1_7.exe
http://www.toniarts.com/files/EClea2_0.exe

Registry cleaner, duplicate detector, Pricelessware.

'Nuff said.

Son Of Spy
--

Read the latest in the Scumsucking SPCA saga HERE:
http://www.sover.net/~wysiwygx/SPCAScum.html

http://www.sover.net/~wysiwygx/index.html
. --- . . - - - - - - - - - - - -
/ SOS \ __ / Freeware - - - - - -
/ / \ ( ) / - - - - -
/ / / / / / / \/ \ - - - -
/ / / / / / / : : - - -
/ / / / / ' ' - -
/ / //..\\
=====UU==UU=====
'///||\\\'
' '' '
 
O

omega

omega said:

Next possibility is Messcleaner, its dupes finder component.
http://www.fortunecity.com/skyscraper/jobbs/79/index.html#Messy


[SORT BY SIZE]
Find Duplicates sorts by size, biggest first. You can't sort any other way.
Yet if that's the sort order you want anyway, then that part should be fine.

Messcleaner flunks on sort by size. It does alphabetical only, no options.
I'm bringing it up regardless, just as I did with Find Duplicates, because
there are so few freeware dupes finder* programs.

(*Note that I'm not including directory synchronizers here, in saying
small number; just dupes finders, where you point at a path or set
of paths.)


[DO WHAT WITH RESULTS?]
Second limit with Find Duplicates. All you can do with the list it gives you
is click properties, or click the delete key, on what you select. No other
functions (eg open containing folder, save list, etc). Yet deletion is all
you've spoken of here.

Messcleaner gives you good options on what to do with files you select from
the list. Choices in the area of Delete, Move, and Save List. What it does
lack is an auto-select feature; you have to go through each dupe group and
decide manually.


[RESULTS IN TABLE DISPLAY]
Find Duplicates gives them out like Dupeless. The columns have size, date,
attributes, file version, product version. (The version info is something
I value.)

Something you don't have is auto-select options, or even the manual selection
checkboxes for deleting, as you do with Dupeless. In Find Duplicates, you
have to go through each file in the list one at a time, hitting the delete
key.

Messcleaner gives a table. Columns are name, size, date (no version info).
Again, it does lack sorting.

Messcleaner gives the selection checkboxes that you have with Dupeless. Then
you can operate on everything you've checked at once, to move it, or delete
it, or save as list.


[SPEED]
Find Duplicates is massively slow, and heavy, a real elephant. It does a
contents compare that can't be turned off, part of the slowdown...

MessCleaner does its work pretty fast. That is, once you've hit scan.

The slow part is the way you have to hassle with its interface. You have to
launch it and let it live in the tray...and then from there, click the "Find
Duplicates" entry from the tray icon menu.


[CUSTOMIZE COMPARE CRITERIA]
Find Duplicates options are on or off for: "Require same name; require
same date/time."

Messcleaner choices: Name; Size; Date; Content.

Btw, it also has an "exclude from compare" feature, by filename or filename
pattern.


[MULTIPLE PATHS]

Best feature of Messcleaner. Multiple paths at a time, instead of just one
directory. As you get with Clonespy.
 
O

omega

omega said:
Next possibility is Messcleaner, its dupes finder component.
http://www.fortunecity.com/skyscraper/jobbs/79/index.html#Messy

Another to consider. Jv16 Power Tools (Pricelessware.org)

[SORT BY SIZE]
Find Duplicates sorts by size, biggest first.
Messcleaner flunks on sort by size.

Jv16 sorts by size. Smallest to largest, but you could just start then at the
bottom of the list.

[DO WHAT WITH RESULTS?]
Second limit with Find Duplicates. All you can do with the list it gives you
is click properties, or click the delete key, on what you select.

Messcleaner gives you good options on what to do with files you select from
the list. Choices in the area of Delete, Move, and Save List.

Jv16: Remove, Recycle Bin, Secure Wipe; or Save List.

[RESULTS IN TABLE DISPLAY]
Find Duplicates gives them out like Dupeless. The columns have size, date,
attributes, file version, product version.

Messcleaner gives a table. Columns are name, size, date (no version info).

Jv16 won't make you happy here. The dupes are grouped, tree style. No
information given on the files. It's sparse as can be.

[SPEED]
Find Duplicates is massively slow, and heavy, a real elephant.

MessCleaner does its work pretty fast. That is, once you've hit scan.

Jv16 is fast. But note: it doesn't compare content.

[CUSTOMIZE COMPARE CRITERIA]
Find Duplicates options are on or off for: "Require same name; require
same date/time."

Messcleaner choices: Name; Size; Date; Content.

Jv16, zero choices.

I'm not precisely sure how it then defines duplicates. Possibly it decides by
yes on matches of Name + Size + Date? (I haven't investigated. I prefer
a tool that gives more information than jv16, about the files in the results
list.)

[MULTIPLE PATHS]

Best feature of Messcleaner. Multiple paths at a time, instead of just one
directory. As you get with Clonespy.

Jv16, one path at a time.
 
O

omega

Spoon2001 said:
Looking for a good dupe file finder. Here some that I've tried:

Looked in my download folder, and found DoubleKiller. Launched it just now,
and am impressed. Light, fast, lots of options. It's possible it meets all
criteria you've listed, although I haven't gone through running it in depth.

Just want to get a fast note in -- to start with this one. At least at
first glance, it appears DoubleKiller might be, of the group listed so
far, the best candidate for what you want.

http://www.bigbangenterprises.de/en/doublekiller/
 
O

omega

Spoon2001 said:
Looking for a good dupe file finder. Here some that I've tried:

Dupeless - PCMag, not freeware anymore. Can't sort the results by size,

DoubleKiller, yes. Ascending or descending.

Can manually select, or let it autoselect, what to delete. Can also do other
things from the results list- such as launch file; or save list; or copy an
entry, with its info, to the clipboard.
CloneSpy - gives me one set of duplicates at a time. I prefer a table list
of duplicate files like I get in Dupeless.

DoubleKiller, table yes. A good one, too.

Further, its interface is superior to Dupeless. Tabbed windows, of generous
size. One for the dupes result list, with action buttons for that. One for
the criteria you've set for finding duplicates. One that is the text help
file.
DupeLocater - taking forever to run

DoubleKiller is a jack rabbit across the field.
- no ability to customize how files are
compared by different criteria (name, size, date, CRC, etc)

DoubleKiller: name, size, date, CRC.

.. . .


Must be this app was custom-built for you.

http://www.bigbangenterprises.de/en/doublekiller/

My one disclaimer is that I've only run it for <20 mins now. Yet really
I can't imagine it has any serious disappointments to manifest later.
 
S

Spoon2001

Omega/Karen,

Many, many thanks for all the great info on these dupe-finder programs.

You've done us a real service!
 
S

Spoon2001

omega said:
"Son Of Spy" <"Son Of Spy"@whatever.net>:

We all know that 1.7's dupes find was not real, that none of us
expect to
live on this earth long enough to actually see if it would ever
complete.


I've seen no reports here, yet, about v2's dupes find feature....

I ran 1.7 tonight ... I didn't let it complete. It was taking forever.
Dupe-finders need to give options compare by name/date/size without
necessarily examining the entire contents of the file.
 
O

omega

Spoon2001 said:
Omega/Karen,

Many, many thanks for all the great info on these dupe-finder programs.

You've done us a real service!

Wow, thanks for saying that.

Thing is your starting post was ideal - how you'd set forth the several
major players of the category, to identify key features found, and lacking.
It made it compelling to then bring into your layout a few more players.
 
S

Spoon2001

DoubleKiller is a jack rabbit across the field.

Arggh ... don't know if it's my system, but DoubleKiller is taking a VERY
long time to scan the files on my disk. Even though I have told
Doublekiller to check filenames only in determining duplicates. I have over
300K files on my disk; Dupeless gets through them very quickly. Don't know
why Doublekiller is bogging down. Otherwise DoubleKiller looked like a
killer app.
 
S

Spoon2001

Spoon2001 said:
Looking for a good dupe file finder. Here some that I've tried:

Dupeless - PCMag, not freeware anymore. Can't sort the results by
size, unless I use size as a criterion for identifying duplicate
files. I really want to be able to sort by size so that I can wipe
out the bigger dupes first (even if the dupes aren't the same size).

CloneSpy - gives me one set of duplicates at a time. I prefer a
table list of duplicate files like I get in Dupeless.

DupeLocater - taking forever to run - no ability to customize how
files are compared by different criteria (name, size, date, CRC, etc)

I've just done a performance comparison.

Dupeless took 10 minutes to go through 385,792 files and identify
duplicates. Very good. I required matching name and size.

I gave Easy Cleaner 2.0 well over an hour to do the same thing, before I
just shut it down. The results display is a list of the groups of matching
files. It doesn't give you anything but the pathname for the files. Thumbs
down.
 
O

omega

Spoon2001 said:
I've just done a performance comparison.

Dupeless took 10 minutes to go through 385,792 files and identify
duplicates. Very good. I required matching name and size.

I gave Easy Cleaner 2.0 well over an hour to do the same thing, before I
just shut it down. The results display is a list of the groups of matching
files. It doesn't give you anything but the pathname for the files. Thumbs
down.

Good to see someone took on testing out the new Easy Cleaner's dupe files
component (helps spare others).

There was a thread recently, where its "what's new" list was shown to
include this statement:

: "Duplicate file finder re-writen, unbelievable speed!"

To which John F replied:

: > That is certainly true of past versions.

And now we have it - thanks to your test - that the new Easy Cleaner 2.0
continues to demonstrate..."unbelievable speed." :)
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top