Question: Consider the following string: abcdetkbyegacxvabyj This string is first tokenized into words ( using spaces as the delimiters ) , and then bigrams and their
Consider the following string:
abcdetkbyegacxvabyj
This string is first tokenized into words using spaces as the delimiters and then bigrams and their frequencies are computed from these words. This bigram set is then used to autogenerate words.
Assume the following rules are used in the autogeneration process:
The most probable word is chosen at every step based on the frequencies in the bigram set.
If two possible choices have the same probability, choose the word that appears alphabetically first.
Stop the generation process if
there are no successor words with nonzero probabilities available
or we have entered an infinite loop.
When given a start word of underlinemathbfb the auto generation process generates which of the following strings?
b d t b
b c d e t k b
b c d e g a b
b y e g a b
None of the options
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
