Question: 3. Neural sequence models (14 points) (a) (2 points) Suppose you want to build a French to English translator system using an LSTM based sequence

3. Neural sequence models (14 points) (a) (2 points) Suppose you

3. Neural sequence models (14 points) (a) (2 points) Suppose you want to build a French to English translator system using an LSTM based sequence to sequence model. What would be fed to the decoder as input at each time step t? Note there should be two components to the input. (b) (2 points) Consider the two components of the input to your decoder from part (a). How does each component affect the generalization of the model during inference? (c) (4 points) Describe two different decoding strategies for generating English translations with your decoder from part (a). For each strategy, explain when you would want to use it and what is a possible drawback of that decoding strategy. (d) (2 points) You are building a text classifier using a simple single-layer, unidirectional RNN. Your friend recommend that you used a GRU cell instead of a LSTM cell. Under what circumstance might the GRU work better than the LSTM? (e) (4 points) You notice that the performance of your classifier from part (d) is not very good. Describe two extensions that you can make to your GRU model from part (d) to improve the performance of your classifier. For each extension, explain why it might help. 3. Neural sequence models (14 points) (a) (2 points) Suppose you want to build a French to English translator system using an LSTM based sequence to sequence model. What would be fed to the decoder as input at each time step t? Note there should be two components to the input. (b) (2 points) Consider the two components of the input to your decoder from part (a). How does each component affect the generalization of the model during inference? (c) (4 points) Describe two different decoding strategies for generating English translations with your decoder from part (a). For each strategy, explain when you would want to use it and what is a possible drawback of that decoding strategy. (d) (2 points) You are building a text classifier using a simple single-layer, unidirectional RNN. Your friend recommend that you used a GRU cell instead of a LSTM cell. Under what circumstance might the GRU work better than the LSTM? (e) (4 points) You notice that the performance of your classifier from part (d) is not very good. Describe two extensions that you can make to your GRU model from part (d) to improve the performance of your classifier. For each extension, explain why it might help

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!

subject: Differential Equations pls read instructions do not use ai. drop all references and link Instructions ODE application. - find an article related to ODE application - provide a short...

Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...

Good communication is just as stimulating as black coffee and just as hard to sleep after. - Anne Morrow Lindbergh In May 2021, David Black, CEO of Blackbox, ended his Zoom call with a sense of...

What are the biggest ah-ha! moments from Oracy Development? 6 English-Language Oracy Development Learning Outcomes After reading this chapter, you should be able to ... . Describe the basics of...

Summary this parts17.1 and 17.2 in this lesson and give one Case study with this lesson. 548 Lext 17 Leadership, Organization, 7 and Corporate Social Responsibility LEARNING OBJECTIVES the companies...

Miller-Rabin test to check whether a number N is composite. This will involve computing a N1 mod N for some value of a. [10 marks] Carry out the steps for N = 65 and a = 1, 2, 8 and 12. on what each...

CHA P TER 9 Understanding Software: A Primer for Managers 1. INTRODUCTION L E A R N I N G O B J E C T I V E S 1. Recognize the importance of software and its implications for the rm and strategic...

Tasks The goal of the project is to complete the code for the NgramAnalyser, MarkovModel, ModelMatcher and MatcherController classes, as detailed below, and to add test code to a new JUnit test...

Question: What is translanguage? Explain why? Prerace IM f you have chosen to read The Translanguaging Classroom: Leveraging Student Bilingualism for Learning, you are probably an educator-a teacher,...

Submitted to Management Science manuscript MS-0001-1922.65 Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title....

Metro Bank has the following balance sheet (in millions), with the risk weights in parentheses. Assets Liabilities and Equity Cash (0%) $ 19 Deposits $ 171 Mortgage loans (50%) $ 65 Subordinate debt...

Logistics Consultants Inc. (LCI) provides various logistics analysis services to other firms, including facility location decisions. It has just completed a project for a major customer, but on the...

3.68 Call centers today play an important role in managing dayto- day business communications with customers. Its important, therefore, to monitor a comprehensive set of metrics, which can help...

Select all that apply Select all the aspects of intrapersonal communication. It must use words. It takes place inside a person's mind. It must have an external audience. It involves only one person.

=+2. Do you eat GM foods? Do you eat organic food? Do you actively shop for food based on its source?

=+1. Do you support the motives behind the EUs precautionary principle?

=+ Do you think it is a wise investment of the firm?