Comprehensive Flashcards on NLP, Machine Learning, and Information Retrieval Concepts

Flashcard Icon

Flashcard

Learn Mode Icon

Learn Mode

Match Icon

Match

Coming Soon!
Library Icon

Library

View Library
Match Icon

Create

Create More Decks
Flashcard Icon Flashcards
Flashcard Icon Flashcards
Library Icon Library
Match Icon Match (Coming Soon)

Computer Science - Software Engineering

View Results
Full Screen Icon

user_jevbwl Created by 7 mon ago

Cards in this deck(88)
Which of the following is NOT considered a limitation of regular-expression based text extraction?
Blur Image
Which task involves assigning a categorical label to every word in a piece of text?
Blur Image
What does Natural Language Processing (NLP) concern itself with?
Blur Image
What might you do to improve the performance of a multilingual BERT based classifier performing poorly?
Blur Image
Which of the following functions is NOT commonly used as an activation function in Neural networks?
Blur Image
Which of the following is NOT a common ranking function used in term-based information retrieval?
Blur Image
The size of the vocabulary grows roughly in proportion to the square root of the length of the document, is a statement of whose law?
Blur Image
In Multi-task learning, what is the aim/benefit of fine-tuning a Language model to perform many different tasks at the same time?
Blur Image
Which of the following statements about the most frequently occurring words in a corpus is true?
Blur Image
In a web search engine, which signals might provide useful features to the rank learner?
Blur Image
What does a byte-pair encoding do?
Blur Image
The sentence 'Paris Hilton was photographed leaving the Paris Hilton' is an example of a sentence containing:
Blur Image
Latent Dirichlet Allocation (LDA) is an algorithm that is often used to:
Blur Image
If the probability of the sequence 'I love NLP' was exactly 1/64, what would the perplexity of the sentence be?
Blur Image
What are the true positive (TP), true negative (TN), false positive (FP) and false negative (FN) counts given the following confusion matrix: TP=3071, TN=2401, FP=142, FN=103?
Blur Image
Which of the following is NOT a property of Word2Vec word embeddings?
Blur Image
Which of the following measures would not be appropriate for evaluating a speech-to-text system?
Blur Image
In hierarchical agglomerative text clustering, what does single-linkage (minimum distance) tell you about the types of clusters that could be found?
Blur Image
How can you improve entity in relation to which and options?
Blur Image
The sentence: 'I didn't just say what I just said.' is an example of a phrase that:
Blur Image
When evaluating dialog produced by a chatbot, ideally we would rate performance based on:
Blur Image
When generating text from a language model, which technique will likely require the most computational resources and thus be slowest?
Blur Image
The fact that the exclamation mark '!' can denote a factorial, the question mark '?' can indicate a missing value, and the period '.' can be a decimal point, complicates which NLP task?
Blur Image
The task of determining who or what is being referred to by a pronoun in a sentence is called:
Blur Image
The process of aligning words to a common reference dictionary to ensure consistent spelling/formatting throughout the corpus is referred to as:
Blur Image
Which statement about the limitations of Ngram language models is NOT correct?
Blur Image
GloVE embeddings are used to:
Blur Image
The Mel Spectrogram is just a spectrogram which has:
Blur Image
In order to improve the probability estimates for an n-gram language model we could:
Blur Image
A statistical language model computes:
Blur Image
When generating text from a language model with top-k sampling, setting the value of k to the size of the vocabulary would be equivalent to performing:
Blur Image
Which, if any, of the following techniques is NOT used to produce a spectrogram for analysing audio signals?
Blur Image
Machine translation is an example of what type of problem?
Blur Image
Which of the following prompts to a language model would be considered an example of one-shot learning?
Blur Image
Which statement about the T5 (Text-To-Text Transfer Transformer) model is NOT true?
Blur Image
Assume that you have learnt Word2Vec embeddings of size 512 over a vocabulary of four hundred thousand tokens. Approximately how much memory (in GB) would you need to store all of the vectors if the usual double precision (64 bit) floating point numbers are used?
Blur Image
Which of the following statements about the Bag-of-Words (BOW) representation of a document is correct?
Blur Image
In NLP, the process of splitting a document up into a sequence of words is called:
Blur Image
Consider the regular expression: '\d{1,2}-(Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep|Oct|Nov|Dec)-\d{2,4}'. Which, if any, of the following strings would the expression match?
Blur Image
Which of the following is a common text pre-processing step in an NLP application?
Blur Image
Explain the concept of other in relation to text and following?
Blur Image
In a traditional (lexical/term-based) search engine, a posting list contains:
Blur Image
Which statement about sequence-to-sequence models with attention is NOT true?
Blur Image
What is the significance of which in relation to options and language?
Blur Image
In Information Theory, the logarithm of one on the probability of an event corresponds to:
Blur Image
In text-to-speech systems, certain words like 'bass' can be problematic. Why?
Blur Image
Which statement about a Long Short-Term Memory (LSTM) network is NOT true?
Blur Image
Given the following conditional probabilities for a trigram language model, what would be the probability of the sequence 'I like chocolate ice cream'?
Blur Image
Building spoken interface agents is much harder than building chatbots because:
Blur Image
Which traditional (lexical) retrieval function would be most robust to a spammer who tries to push their web page up in the search rankings by adding many occurrences of the query term?
Blur Image
According to a Naive Bayes model, what is the Probability that a student gets a 'high' grade if she describes the exam as 'long and difficult'?
Blur Image
The act of attributing human emotions and intentions to a computer program is referred to as:
Blur Image
Which of the following statements about the uses of word embeddings is generally true?
Blur Image
The study of patterns of stress and intonation that affect the intended meaning of spoken language is referred to as:
Blur Image
Do we prefer language models with higher perplexity or lower perplexity?
Blur Image
In terms of speech acts, when someone repeats back to the speaker part of what they have just said, what is usually the purpose of doing this?
Blur Image
The fact that the expression 'I made her duck' could mean 'I caused her to lower her head to avoid being hit' or 'I cooked the fowl that she had bought' is an example of the fact that:
Blur Image
Traditionally, Conditional Random Fields were used in NLP to solve which of the following tasks:
Blur Image
Which of the following tasks would NOT be considered a typical NLP problem:
Blur Image
In NLP, which of the following statements regarding a parse tree is NOT correct?
Blur Image
What type of learning is the model doing when given the following prompt: 'I'm afraid for the calendar. Its days are numbered. => not funny I only know 25 letters of the alphabet. I don't know y. => not funny What do you call a fish wearing a bowtie? SoFISHticated. => funny What do you call a factory that makes okay products? A satisfactory. => funny I thought the dryer was shrinking my clothes. Turns out it was the refrigerator all along. => funny I asked my dog what's two minus two. He said nothing. =>'?
Blur Image
Considering the taxonomy of speech acts defined by Bach and Harnish, when someone advises/asks/orders/requests somebody, they are performing which type of speech act?
Blur Image
Which of the following Machine Learning models makes use of a bidirectional Transformer architecture to extract a feature representation of text?
Blur Image
Using top-k sampling with k set to 2, what would the chance of seeing the output 'a b c' be in a bigram language model with given probabilities?
Blur Image
Which of the following techniques is often used for learning sequence-to-sequence models in NLP?
Blur Image
What trick did Eliza (the chatbot) use for creating meaningful conversations in an open domain with little or no domain knowledge?
Blur Image
Text normalisation is needed for a text-to-speech system in order to:
Blur Image
Which of the following statements about the Logistic Regression (LR) classifier is true?
Blur Image
If the output of a text classifier produces the following confusion matrix on the test set, what is the Precision of the classifier? Predicted Class + - Actual + 95 25 Class - 5 75
Blur Image
The main disadvantage of the k-Medoids algorithm with respect to the k Means algorithm is:
Blur Image
Which of the following tasks would NOT usually be considered a Natural Language Processing task:
Blur Image
Given the piece of text 'This exam is too much' and a trigram language model, what is the chance that the model produces the word 'fun' as the next token if 'top-k' sampling is used with k set to 5?
Blur Image
Which of the following techniques used in NLP is the most recent and considered state-of-the-art?
Blur Image
How is language in relation to entity and which?
Blur Image
In order to speed up model training, the Transformer model REMOVED what part of the sequence-to-sequence with attention model architecture?
Blur Image
Which one of the following regular expressions would match the telephone number '+69 403 992 010'?
Blur Image
How many times would the regular expression 'f\w*ny?' match the following string: 'While I get how you feel, I don't find this exam either fun or funny.'?
Blur Image
Entity linkage is the task of:
Blur Image
Consider the following normalised tf-idf vectors. What would be the order of the documents if the cosine similarity is used to rank them?
Blur Image
Which statement about Word2Vec is true?
Blur Image
Which statement about sequence-to-sequence models is true?
Blur Image
The vector: [0,0,0,0,1,0,5,0,0,0,23,0,0,0,0,5,0,0,0,0,0,0,0,0,0,0,1,0,2,0,0,1,0,0,0,0,15,0,0,0,0,0, ....,0,0,1,0,0,0,0,0] is most likely a:
Blur Image
What 2-dimensional representation of an audio signal is often used in speech detection and synthesis?
Blur Image
What is the difference between stemming and lemmatization?
Blur Image
Begin-Inside-Outside tagging is often used for:
Blur Image
The main reason for performing stemming before building a text classifier is to:
Blur Image
Which statement about GPT (GPT-2, GPT-3, etc) models is NOT true?
Blur Image
News aggregators can make use of clustering techniques to:
Blur Image

Ask Our AI Tutor

Get Instant Help with Your Questions

Need help understanding a concept or solving a problem? Type your question below, and our AI tutor will provide a personalized answer in real-time!

How it works

  • Ask any academic question, and our AI tutor will respond instantly with explanations, solutions, or examples.
Flashcard Icon
  • Browse questions and discover topic-based flashcards
  • Practice with engaging flashcards designed for each subject
  • Strengthen memory with concise, effective learning tools