Finddupe is a VERY FAST command line C program to catalog very large archives, identifying duplicate files even when offline. It has many features. You can easily grep a catalog to find what you have, and locate where it is.
this only makes hard links to the duplicates. It offers _no_ way to delete the files, or move them to another directory.