Question: PLEASE SHOW ALL WORK AND EXPLAIN AS I AM TRYING TO LEARN PLEASE. NLP A group of monkeys in Nevada learned to talk. A group

PLEASE SHOW ALL WORK AND EXPLAIN AS I AM TRYING TO LEARN PLEASE. NLP

A group of monkeys in Nevada learned to talk. A group of scientists study them and develop a training corpus. Their way of talking only has 3 letters. E, R, T.

This is the corpus scientist collected: EEERETRERRRTTETRTT

1. Consider training a simple unigram language model using maximum likelihood estimation

FIND all Unigrams and state their probability in fraction.( do Not add an end symbol or start )

2. Train a bigram Language Model using maximum likelihood estimation

FIND all Bigrams and state their probability in fraction ( ADD end symbol # but do NOT add start tokens.)

3. The scientist send their boss to test the data and examine their language model. Here is the boss's data : ERETERR

A) State the perplexity of the unigram Language Model. (DONT add any tokens to the test data for the sake of consistency)

B) State the perplexity of the bigram Language Model. (DO add an end symbol # for the sake make it consistently to make it consistent)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!