Question: 4. Suggest an algorithm that checks if your 10TB disk hard drive contains two identical files. You are not allowed to use values provided by

4. Suggest an algorithm that checks if your 10TB disk hard drive contains two identical files. You are not allowed to use values provided by the File System/Operating System (such as MD5). The number n of files is about 1010. Note that 1GB is roughly 10% If there are any identical pairs of files, your algorithm should print the names of such a pair, and stop. Suggest a solution that is efficient both in terms of CPU time and in terms of the number of disk access operations (I/O). Your algorithm should be practical for your currant desktop or PC. There are a several heuristics that could be efficient in certain scenarios, but could also fail miserably. Using the size of the file will fails if most of the files are images from your camera (uncompressed). Summing up ASCII values is slightly better, but is iffy for images because it mostly depending on the background pixels. You do not have access to the creation date, and even if you do, it will not assist if you are looking for copyright violation. In short, try to do better. 4. Suggest an algorithm that checks if your 10TB disk hard drive contains two identical files. You are not allowed to use values provided by the File System/Operating System (such as MD5). The number n of files is about 1010. Note that 1GB is roughly 10% If there are any identical pairs of files, your algorithm should print the names of such a pair, and stop. Suggest a solution that is efficient both in terms of CPU time and in terms of the number of disk access operations (I/O). Your algorithm should be practical for your currant desktop or PC. There are a several heuristics that could be efficient in certain scenarios, but could also fail miserably. Using the size of the file will fails if most of the files are images from your camera (uncompressed). Summing up ASCII values is slightly better, but is iffy for images because it mostly depending on the background pixels. You do not have access to the creation date, and even if you do, it will not assist if you are looking for copyright violation. In short, try to do better
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
