Question: Plagiarism Detector Introduction In this homework you will construct a system that checks the similarity rate between a specific document and a set of documents

Plagiarism Detector

Introduction In this homework you will construct a system that checks the similarity rate between a specific document and a set of documents in java.

Implementation Input files

The program will get two inputs: 1) A set of documents (.txt files) under a folder, 2) The main file (.txt file) which will be checked for plagiarism. Outputs The program will display the following outputs:

1) The similarity rate between the main document and each document under the folder given as input,

2) The most similar 5 statements / sentences for each document.

For the evaluation of your implementation, during the demo that will be held in the last week of the semester you will be asked to run your program for a folder containing multiple .txt files named as document1.Txt, document2.Txt, etc. and a main .txt file called main_doc.txt.

1.1 Details - You can use any text searching algorithm and/or data structure you would like; you can even use algorithms not discussed in class. - The pattern matching algorithm must be written by yourself. - In addition to your algorithms similarity detection ability, you will also be graded on the speed of your code, so you should try to choose an efficient algorithm.

1.2 Important Instructions

Yourprogram shouldgive an output tothe consolein thefollowing format when executed:

1) {Similarity Rate with the First Document} {Most Similar Sentence} {Second Most Similar Sentence} {Third Most Similar Sentence} {Fourth Most Similar Sentence} {Fifth Most Similar Sentence}

2) {Similarity Rate with the Second Document} {Most Similar Sentence} {Second Most Similar Sentence} {Third Most Similar Sentence} {Fourth Most Similar Sentence} {Fifth Most Similar Sentence} 3) {Similarity Rate with the Third Document} {Most Similar Sentence} {Second Most Similar Sentence} {Third Most Similar Sentence} {Fourth Most Similar Sentence} {Fifth Most Similar Sentence}

------------------------------------------------------------------------------------------------

Dear chegg teacher you can use your own txt fles for input. I just need the code. Please name your txt files as mentioned above. With that, the code will be enough for me.

I will be so glad if you write comments that describes what is it doing.

Thank you from now.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!