Finds duplicate filesystem extents and optionally schedule them for deduplication. An extent is small part of a file inside the filesystem. On some filesystems one extent can be referenced multiple times, when parts of the content of the files are identical. More information: https://markfasheh.github.io/duperemove/.
duperemove -r path/to/directory
duperemove -r -d path/to/directory
duperemove -r -d --hashfile=path/to/hashfile path/to/directory
duperemove -r -d --hashfile=path/to/hashfile --io-threads=N --cpu-threads=N path/to/directory