Question: 4. Suggest an algorithm that checks if your 10TB disk hard drive contains two identical files. You are not allowed to use values provided by

 4. Suggest an algorithm that checks if your 10TB disk hard

4. Suggest an algorithm that checks if your 10TB disk hard drive contains two identical files. You are not allowed to use values provided by the File System/Operating System (such as MD5). The number n of files is about 1010. Note that 1GB is roughly 10% If there are any identical pairs of files, your algorithm should print the names of such a pair, and stop. Suggest a solution that is efficient both in terms of CPU time and in terms of the number of disk access operations (I/O). Your algorithm should be practical for your currant desktop or PC. There are a several heuristics that could be efficient in certain scenarios, but could also fail miserably. Using the size of the file will fails if most of the files are images from your camera (uncompressed). Summing up ASCII values is slightly better, but is iffy for images because it mostly depending on the background pixels. You do not have access to the creation date, and even if you do, it will not assist if you are looking for copyright violation. In short, try to do better. 4. Suggest an algorithm that checks if your 10TB disk hard drive contains two identical files. You are not allowed to use values provided by the File System/Operating System (such as MD5). The number n of files is about 1010. Note that 1GB is roughly 10% If there are any identical pairs of files, your algorithm should print the names of such a pair, and stop. Suggest a solution that is efficient both in terms of CPU time and in terms of the number of disk access operations (I/O). Your algorithm should be practical for your currant desktop or PC. There are a several heuristics that could be efficient in certain scenarios, but could also fail miserably. Using the size of the file will fails if most of the files are images from your camera (uncompressed). Summing up ASCII values is slightly better, but is iffy for images because it mostly depending on the background pixels. You do not have access to the creation date, and even if you do, it will not assist if you are looking for copyright violation. In short, try to do better

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!