Question: Deep Learning Architectures: ( a ) Given a task to develop a language model for French speaking, Peter plans to train a bidirectional LSTM model
Deep Learning Architectures:
a Given a task to develop a language model for French speaking, Peter plans to train a bidirectional LSTM model for it based on the corpus. Before he starts, what suggestions do you have for him about the design? points
b In attention mechanism, why are the word embeddings from input not directly used for key, query, and values? point
c after pretraining a model eg BERT sometimes we only use the encoder part and sometimes we only use the decoder part. Please explain briefly in what scenarios we use encoder and in what scenarios we use decoder. point
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
