Question: treasure1.txt The Old Sea-dog at the Admiral Benbow SQUIRE TRELAWNEY, Dr Livesey, and the rest of these gentlemen having asked me to write down the

Write a function token(fileNameA, fileNameB, x) that, given two strings fileNameA and fileNameB that contain pathnames to t

treasure1.txt

The Old Sea-dog at the Admiral Benbow

SQUIRE TRELAWNEY, Dr Livesey, and the rest of these gentlemen having asked me to write down the whole particulars about Treasure Island, from the beginning to the end, keeping nothing back but the bearings of the island, and that only because there is still treasure not yet lifted, I take up my pen in the year of grace 17__ and go back to the time when my father kept the Admiral Benbow inn and the brown old seaman with the sabre cut first took up his lodging under our roof.
frankenstein1.txt

I am by birth a Genevese, and my family is one of the most distinguished of that republic. My ancestors had been for many years counsellors and syndics, and my father had filled several public situations with honour and reputation. He was respected by all who knew him for his integrity and indefatigable attention to public business. He passed his younger days perpetually occupied by the affairs of his country; a variety of circumstances had prevented his marrying early, nor was it until the decline of life that he became a husband and the father of a family.

**"Write a function token(fileNameA, fileNameB, x) that, given two strings fileNameA and fileNameB that contain pathnames to two text files (encoded in utf8), and a floating point number e < x < 1, returns the list of all the words having a frequency larger than or equal to x in at least one of the two files. As usual, a word is a maximal sequence of alphabetical characters. When reading the files, al1 the words have to be made lower case (e.g., the string "Alice", and the string "ALICE", should be transformed into "alice"). For instance, suppose that the "en. txt" file contains: --- Alice is about to say "Sherlock, it's me, Alice. Sherlock... Sherlock!" Suppose further that the "it.txt" file contains: --- Alice dice "Ciao Sherlock, sono Alice, Sherlock... Sherlock!". --- Then, "en. txt" contains 12 words, and "it.txt" contains 8 words. The frequency of "sherlock" in "en. txt" is 0.25 (3/12). The frequency of "alice" in "en. txt" is, instead, approximately 0.166666666667 (2/12). The frequency of "sherlock" in "it. txt" is 0.375 (3/8). And, the frequency of "alice" in "it.txt" is 0.25 (2/8). Each other word, in each of the files, has frequency smaller than 1/7.0 Therefore, token("en.txt", "it. txt", 0.3) must return the list ["sherlock"]. Viceversa, token("en. txt", "it. txt", 0.24) can return the list ["alice", "sherlock"), or the list ["sherlock", "alice"]. (No specific ordering of the words is required.) Be aware that, to run the grader correctly, you should run it in a directory that contains all the content of the zip file, and your program03.py. Please remember not to change the name of the function, and not to use non-ascii characters.

Step by Step Solution

★★★★★

3.33 Rating (144 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

def wordcountstrfor char in str strreplacechar str strlowercounts dictwords strsplittotalcount0for w... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

The list of all passwords is kept within the operating system. Thus, if a user manages to read this list, password protection is no longer provided. Suggest a scheme that will avoid this problem....

List at least 10 ambiguous words that should not be used in framing questions.

In order to transform one source string of text x [1 m] to a target string y [1 n], we can perform various transformation operations. Our goal is, given x and y, to produce a series of...

In this assignment, you are writing a program that converts common texting abbreviations to English words to allow people like yours truly can understand. For the assignment, you are provided with a...

Describe the series of steps that most firms take in setting dividend policy in practice. MINI CASE Southeastern Steel Company (SSC) was formed 5 years ago to exploit a new continuous-casting...

Equipment replacement, no income taxes. Pro Chips is a manufacturer of prototype chips based in Dublin, Ireland. Next year, in 2010, Pro Chips expects to deliver 552 prototype chips at an average...

Designing a Balanced Scorecard for a pharmaceutical company Chadwick, Inc.: The Balanced Scorecard (Abridged)14 Company Background Chadwick, Inc., was a diversified producer of personal consumer...

You are trying to decide how much to save for retirement. Assume you plan to save $5000 per year with the first investment made 1 year from now. You think you can earn 10% per year on your...

Activity-based costing, merchandising Pharmacare, Inc., a distributor of special pharmaceutical products, operates at capacity and has three main market segments: a. General supermarket chains b....

A local restaurateur who had been running a profitable business for many years recently purchased a three-way liquor license. This license gives the owner the legal right to sell beer, wine, and...

Consider Projects Alpha and Omega. The cost of capital for the projects is 10.00% ALPHA OMEGA ($1,350,000)($1,000,000) 400,000 300,000 400,000 300,000 400,000 300,000 400,000 300,000 400,000 300,000...

What can professors/teachers do to betterpereparestudents in sales classes?

Which specific part of the Internal Revenue Code defines qualifying relative? Which Internal Revenue Code Section spells out the tax treatment for landlords (lessor) when they have improvements...

Develop a graphic model which presents six (6) workstations and a Server which presents the operation of the addressing between the stations and the server which will share data and applications...

Blue Caf makes and sells a variety of iced coffee. The main ingredients consist of ground coffee, milk and sugar. The company has a standard costing and variance system in place to control the...

Exercise 2 Several years ago, ballots in Champaign-Urbana contained the following question to assess public opinion on an issue: \"Should the State of Illinois legalize and regulate the sale and use...

Use Dijkstra's algorithm to find the length of a shortest path between the vertices a and z in the given weighted graph.

Cassandra Casey operates the Futuristic Antique Store. She maintains subsidiary ledgers for accounts payable and accounts receivable. She presents you with the following information for October 2019:...

Three hundred thousand years after the Big Bang, the temperature of the universe was 3000 K. Because of expansion, the temperature of the universe is now 2.75 K. Modeling the universe as an ideal gas...

83Li is an isotope that has a lifetime of less than a second. Its mass is 8.022485 u. Calculate its binding energy in MeV.

A canvas tent has a single, tiny hole in its side. On the opposite wall of the tent, 2.0 m away, you observe a dot (due to sunlight incident upon the hole) of width 2.0 mm, with a faint ring around...

People with more knowledge are more resistant to persuasion because they can counterargue messages that take a position opposite to what they know and believe.

1. A new boutique coffeehouse just opened in your neighborhood featuring coffee sustainably sourced from small organic farms around the world. Design two ads for the coffeehouse, one using the...

1. When someone makes claims concerning the nature of human behavior and mental processes, how do you know to believe those claims? Adopting a scientific point of view requires a good deal of healthy...