ddupes
Differenze
Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.
Entrambe le parti precedenti la revisioneRevisione precedenteProssima revisione | Revisione precedente | ||
ddupes [2012/03/13 08:29] – other tools similar to fdupes pietro | ddupes [2017/01/20 19:10] (versione attuale) – [What is this?] true edit pietro | ||
---|---|---|---|
Linea 5: | Linea 5: | ||
===== What is this? ===== | ===== What is this? ===== | ||
+ | |||
**ddupes** is a python program which extends fdupes action to directories. | **ddupes** is a python program which extends fdupes action to directories. | ||
Linea 30: | Linea 31: | ||
necessarily read //all// files it must compare: instead, it first tries to | necessarily read //all// files it must compare: instead, it first tries to | ||
compare the heads, and reads the rest only if they match. | compare the heads, and reads the rest only if they match. | ||
+ | |||
+ | A test of larger size (thanks, Florian Bruhin!), ran with 2.5 TB of data, in | ||
+ | ~727 000 files, gave the following results: | ||
+ | * fdupes: | ||
+ | * ffdupes: 4 Hours 19 Minutes | ||
+ | * ddupes: | ||
That said, in the worst case in which there are many files which are almost | That said, in the worst case in which there are many files which are almost | ||
Linea 37: | Linea 44: | ||
If ffdupes is used with the " | If ffdupes is used with the " | ||
- | run statistically | + | run slower |
run faster than fdupes in //all// cases). | run faster than fdupes in //all// cases). | ||
Linea 70: | Linea 77: | ||
of members) groups of duplicates, which reside in directories which are very | of members) groups of duplicates, which reside in directories which are very | ||
similar but not identical. This should be a quite remote eventuality, | similar but not identical. This should be a quite remote eventuality, | ||
- | do find some patologic | + | do find some pathological |
===== Who should I blame if this sucks? ===== | ===== Who should I blame if this sucks? ===== | ||
Linea 76: | Linea 83: | ||
Pietro Battiston - < | Pietro Battiston - < | ||
- | Last version of ddupes can always be found at | + | Last version of ddupes can always be found at http:// |
- | http:// | + | The source repo can be obtained with |
+ | git clone git:// | ||
+ | and browsed at http:// | ||
===== Requirements ===== | ===== Requirements ===== | ||
ddupes and ffdupes are written in Python, so you need python to run them. | ddupes and ffdupes are written in Python, so you need python to run them. |
ddupes.1331623781.txt.gz · Ultima modifica: 2012/03/13 08:29 da pietro