Question: Help me to get the maximum points on these questions! These questions are not from my exam today, but rather documentation that I obtained in
Help me to get the maximum points on these questions! These questions are not from my exam today, but rather documentation that I obtained in More detailed question is informed on the picture
Questions:
Explain intuitively:
a What is the tokenization? points
b What is the different between stemming and lemmatization? Give an example!
points
Let we have corpus: low, newer, wider, new.
a Do subword tokenization using wordpiece tokenization in three iterations. points
b What is the tokenization of word "lower" according to wordpiece? points
We are given the following corpus, modified from the one in the chapter:
I am Sam
Sam I am
Sam like eggs
I do not like green eggs and Sam
a Create the bigram counting table and probability table! points
b If we use linear interpolation smoothing between a maximumlikelihood bigram
model and a maximumlikelihood unigram model with and what
is Sam Include and in your counts just like any other token.
points
Given the following short movie reviews, each labeled with a genre, either comedy or action:
fun, couple, love, love : comedy
fast, furious, shoot : action
couple, fly, fast, fun, fun : comedy
furious, shoot, shoot, fun : action
fly, fast, shoot, love : action
A new document D: fast, couple, shoot, fly
a Compute the most likely class for Assume a naive Bayes classifier and use add
smoothing for the likelihoods. points
b Compute the vector representation of document D using tfidf. points
Explain what the attention mechanism is give a brief example and visualization! points
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
