Implement a language identification system in the programming language of your choice. You should start by selecting

Question:

Implement a language identification system in the programming language of your choice. You should start by selecting several languages (four or five should do). You should have a suitable quantity of typical material in each language—about 1000 words in each language would be plenty. First, write an algorithm that determines the most common 100 trigrams in each language. Now build these data into a program that uses it to determine the language of unseen text. Produce an alternative version of the software that calculates a frequency vector using all (26 * 26 * 26) trigrams. How does this system perform compared with the first one you produced in terms of accuracy and efficiency?

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  book-img-for-question
Question Posted: