Question: In this question, you will learn to implement the Abstractive Summarization task using the encoder - decoder model. 1 . Load the dataset cnn -

In this question, you will learn to implement the Abstractive Summarization task using the encoder

-

decoder model.

1 .

Load the dataset cnn

-

dailymail

\ ({}^{1} \) .

Use version

3.0.0 .

Use train set for training and test set for testing purposes. Use columns article which is the news article published in CNN and Daily Mail, and highlights which is the summary of the given news article.

2 .

Preprocess the article column and the highlights column with NLTK

.

The preprocessing steps include converting the text to lower case, removing special characters and punctuations. Add and tokens at the start and the end of each row of the highlights text.

3 .

Tokenize the text in both columns and build a separate vocabulary for each column. Create the word

-

-

index and index

-

-

word dictionaries for article column and convert the text in article column to the index vector.

4 .

You should use an encoder

-

decoder architecture to generate the text. The encoder should be a RNN

(

.

.,

LSTM

,

GRU, etc.

)

and the decoder should be a fully connected

(

dense

)

layer. Note that the initial hidden states for the decoder will be the output hidden states of the encoder model. Also, the input sequence to the decoder will have the same length that of the target sequence starting with

.

In the inference stage, the process will run till token is encountered.

5 .

Train the model with cross

-

entropy loss. On the test set, generate the summary using beam search. Your model should be able to generate text of at least

10

words.

6 .

Use the rouge module in PyTorch

\ ({}^{2} \),

calculate and report the average ROUGE

- 1,

ROUGE

- 2,

and ROUGE

-

L scores for the test set.

In this question, you will learn to implement the

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Q 2 . Abstractive Summarization using T 5 : Code [ 5 ] In this question, you will implement the summarization task using the pre - trained model T 5 . 1 . Load the dataset cnn - dailymail with 3 . 0...

Q 3 . Neural Machine Translation: Code [ 2 0 ] In this question, you will learn to implement the Neural Machine Translation task using a sequence - to - sequence model with attention. 1 . Load WMT 1...

(REALLY NEED HELP CREATING THIS CODE IN FULL AND ITS COMPLETE ENTIRETY... ALL OF THE DETAILS ARE PROVIDED AND THE CODE SHOULD HAVE EACH PART FOR EACH QUESTION LABELED SEPARATELY... PLEASE HELP ME AND...

In this question, you will learn to build a Naive Bayes Classifier for the binary classification task. 1 . Dataset: "Financial Phrasebank" dataset from HuggingFace . 1 To load the data, you need to...

Image captioning using Deep Learning General Instructions: You are recommended to use Google Colab or Jupyter notebook. No need to upload data. If the pre-processed data, in the form of Python pickle...

1 Assignment 2 Latent Variables and Neural Networks Due Date: 21:59:59 23 May 2021 Please note that, 1. 1 sec delay will be penalized as 1 day delay. So please submit your assignment in advance...

INSTRUCTIONS ---> Python There are three parts to this project in Python. Please read all sections of the instructions carefully. I. Perceptron Learning Algorithm II. Linear Regression III....

INSTRUCTIONS There are three parts to this project in Python. Please read all sections of the instructions carefully. I. Perceptron Learning Algorithm II. Linear Regression III. Classification You...

(a) In SystemVerilog, what is the difference between: (i) The ternary operator ? and if...then...else statements? [2 marks] (ii) always_ff and always_comb? [2 marks] (iii) Blocking, non-blocking and...

Write a part of code (loop) that displays the following series. 1 1 10 20 30 40 90 100 Your answer

Given the following relative frequencies of the letters of the alphabet, design a prefix-free code that minimizes the encoding length. Show the detail steps of the Huffman- coding algorithm. Letter...

Which of the following is true? Multiple Choice Members of an audit engagement team cannot speak with audit client officers about matters outside the scope of the audit while the audit engagement is...

Consider the project network shown below. Think about fast-tracking in the absence of resource constraints. Suppose activities 5-8 and 8-12 can be done in parallel. At what level of parallelization is