Question: This is in python #A common problem in academic settings is plagiarism #detection. Fortunately, software can make this pretty easy! # #In this problem, you'll

This is in python

#A common problem in academic settings is plagiarism

#detection. Fortunately, software can make this pretty easy!

#In this problem, you'll be given two files with text in

#them. Write a function called check

_

plagiarism with two

#parameters, each representing a filename. The function

#should find if there are any instances of

5

or more

#consecutive words appearing in both files. If there are,

#return the longest such string of words

(

in terms of number

#of words, not length of the string

) .

If there are not,

#return the boolean False.

#For simplicity, the files will be lower

-

case text and spaces

#only: there will be no punctuation, upper

-

case text, or

#line breaks.

#We've given you three files to experiment with. file

_1 .

txt

#and file

_2 .

txt share a series of

5

words: we would expect

#check

_

plagiarism

("

file

_1 .

txt

",

"file

_2 .

txt

")

to return the

#string

"

if i go crazy then". file

_1 .

txt and file

_3 .

txt

#share two series of

5

words, and one series of

11

words:

#we would expect check

_

plagiarism

("

file

_1 .

txt

",

"file

_3 .

txt

")

#to return the string "i left my body lying somewhere in the

#sands of time". file

_2 .

txt and file

_3 .

txt do not share any

#text, so we would expect check

_

plagiarism

("

file

_2 .

txt

",

#"file

_3 .

txt

")

to return the boolean False.

#Be careful: there are a lot of ways to do this problem, but

#some would be massively time

-

or memory

-

intensive. If you

#get a MemoryError, it means that your solution requires

#storing too much in memory for the code to ever run to

#completion. If you get a message that says "KILLED", it

#means your solution takes too long to run.

#Add your code here!

def check

_

plagiarism

(

file

1_

name, file

2_

name

)

with open

(

file

1_

name,

'

')

as file

1,

open

(

file

2_

name,

'

')

as file

2

text

1 =

file

1 .

read

() .

split

()

text

2 =

file

2 .

read

() .

split

()

longest

_

match

= []

current

_

match

= []

for word

1

in text

1

if word

1

in text

2

current

_

match.append

(

word

1)

else:

if len

(

current

_

match

) >

len

(

longest

_

match

)

longest

_

match

=

current

_

match

current

_

match

= []

if len

(

current

_

match

) >

len

(

longest

_

match

)

longest

_

match

=

current

_

match

if len

(

longest

_

match

) > = 5

return

'' .

join

(

longest

_

match

)

else:

return False

#Below are some lines of code that will test your function.

#You can change the value of the variable

(

)

to test your

#function with different inputs.

#If your function works correctly, this will originally

#print:

#if i go crazy then

#i left my body lying somewhere in the sands of time

#False

(

check

_

plagiarism

("

file

_1 .

txt

",

"file

_2 .

txt

"))

(

check

_

plagiarism

("

file

_1 .

txt

",

"file

_3 .

txt

"))

(

check

_

plagiarism

("

file

_2 .

txt

",

"file

_3 .

txt

"))

I receive this output:

if i go crazy then

i left my body lying somewhere in the sands of time i watched the world float to the

False

the part where it says "I watched the world float to the

"

is not supposed to be there, how can I fix this?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!

A common problem in academic settings is plagiarism detection. Fortunately, software can make this pretty easy! In this problem, you'll be given two files with text in them. Write a function called...

E-1. Analyze the five interviews. In a paragraph, discuss what type of structure each interview has. E-2. List each interview, 1 through 5, and then write a paragraph for each, discussing ways that...

This is a individual assignment, which is due on 27th of jan, wed, 1pm. Can u help me to do it? Financial Accounting 2 ACG 27, Study Period 4, 2015 Case Study for Annual Report Assignment The...

Working on the City of Smithville Short version. I need the journal entries for Chapters 5, 6, and 9 (including Budgetary) ASAP. Thank you for your help. Instructions City of Smithville Short Version...

I would like to have the entries for chapter 5 of the Smithville computer case (short version) see attachment Instructions City of Smithville Short Version Computerized Cumulative Problem For use...

Can I have with chap 6 and 9 journal entries of Smithville Project Project attached Instructions City of Smithville Short Version Computerized Cumulative Problem For use with McGraw-Hill/Irwin...

After reading the passage below. Please assist with how might these styles impact the choice of theoretical approach with a client. What are Attributional and Explanatory Styles? Over time the...

Match the letters on the right with the numbers on the left to complete the mathematical statement about PDF properties. Assume that x is a normally distributed annual occurrence. 1. a. Standard...

Listed below are several transactions that took place during the first two years of operations for the law firm of Pete, Pete, and Roy. Year 1 Year 2 Amounts billed to clients for services rendered...

Your company has a project available with the following cash flows: \ table [ [ , ] , [ 0 , - $ 8 0 , 1 0 0

how to calculate the least squares regression line? Usung the following formula: Y'=by+a The least-squares regression line Y' = byX + ay Because we are predicting Y given X, we call it the regression...