Finding and removing duplicate files

In my last post, I explained how I wanted to confirm that I had properly restored all my files from a hard drive crash before freeing up space on my external drive by deleting some of the old backups. I had started that process by using a directory comparison utility, but discovered that looking at each directory one at a time would take too long. When I went back to the file management section of my favorite download site, I discovered a couple of utilities with the word “duplicate” in the title. Bingo. I found a plethora of options using the following search terms:

remove duplicate files
find duplicates
delete duplicates

I needed a utility that would let me match files based on name and size, since the modification dates had been changed after I restored my files. Several utilities also offer to match using a CRC (checksum). I don’t feel that I’ve found the perfect utility yet, but I found two very good ones that I’ll cover here.

Duplicate Cleaner 1.2

The interface for this tool is a little clunky, but the functionality was great. Here is a shot of the main screen:

Duplicate Cleaner 1.2

After pointing the program to the places you want to search for duplicates, you press the Start button and the program switches to a screen full of the results. Here is a shot of the results screen with the Selection Assistant:

Selecting Dups

The selection assistant is one of the best features of this utility. With the selection assistant, for example, I was able to select only duplicates that came from my external drive where I keep backup files. The one and only problem I had with this utility was that after using it to compare very large sets of files and selecting the duplicates found on my backup drive, the system gave me a warning message (before deleting, thankfully) saying that I had selected all the files in 11 groups of duplicates. One the one hand, this was a very helpful warning because the whole point of my effort was to confirm that I wasn’t about to delete unique files on my backup drive. On the other hand, the utility didn’t tell me which 11 of the 5,000+ sets where the problem. I had to break my effort into smaller sets, and I was still grateful for the warning.

Double Killer 1.6.2

This is the second utility that I liked. You can see from the shot of this setup screen that you can choose search paths and select matching criteria:

  Double Killer 1.6.2

This utility allows you to select the first duplicates in a set or the last duplicates in a set, thus, avoiding the problem of selecting every instance of a given copy. Your other selection options are pretty limited, however, so I think this utility is a better choice for someone who has files that are pretty organized to begin with.

Advertisement

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s


Follow

Get every new post delivered to your Inbox.