Question: This is about RNN - Rebuild Model. I used below Model in Google Collab LOADING THE IMDB DATASET: from tensorflow.keras.datasets import imdb (train_data, train_labels),

This is about RNN - Rebuild Model. I used below Model in Google Collab

LOADING THE IMDB DATASET:

from tensorflow.keras.datasets import imdb

(train_data, train_labels), (test_data, test_labels) = imdb.load_data(

num_words=10000)

train_data[0]

train_labels[0]

max([max(sequence) for sequence in train_data])

Decoding reviews back to text

word_index = imdb.get_word_index()

reverse_word_index = dict(

[(value, key) for (key, value) in word_index.items()])

decoded_review = " ".join(

[reverse_word_index.get(i - 3, "?") for i in train_data[0]

PREPARING THE DATA:
Encoding the integer sequences via multi-hot encoding

import numpy as np

def vectorize_sequences(sequences, dimension=10000):

results = np.zeros((len(sequences), dimension))

for i, sequence in enumerate(sequences):

for j in sequence:

results[i, j] = 1.

return results

x_train = vectorize_sequences(train_data)

x_test = vectorize_sequences(test_data)

x_train[0]

y_train = np.asarray(train_labels).astype("float32")

y_test = np.asarray(test_labels).astype("float32")

BUILDING THE MODEL:

Model Definition

from tensorflow import keras

from tensorflow.keras import layers

model = keras.Sequential([

layers.Dense(16, activation="relu"),

layers.Dense(1, activation="sigmoid")

])

Compiling the model

model.compile(optimizer="rmsprop",

loss="binary_crossentropy",

metrics=["accuracy"])

VALIDATING THE MODEL:

Setting aside a validation set

x_val = x_train[:10000]

partial_x_train = x_train[10000:]

y_val = y_train[:10000]

partial_y_train = y_train[10000:]

Training your model

history = model.fit(partial_x_train,

partial_y_train,

epochs=20,

batch_size=512,

validation_data=(x_val, y_val))

history_dict = history.history

history_dict.keys()

Plotting the training and validation loss

import matplotlib.pyplot as plt

history_dict = history.history

loss_values = history_dict["loss"]

val_loss_values = history_dict["val_loss"]

epochs = range(1, len(loss_values) + 1)

plt.plot(epochs, loss_values, "bo", label="Training loss")

plt.plot(epochs, val_loss_values, "b", label="Validation loss")

plt.title("Training and validation loss")

plt.xlabel("Epochs")

plt.ylabel("Loss")

plt.legend()

plt.show()

Plotting the training and validation accuracy

plt.clf()

acc = history_dict["accuracy"]

val_acc = history_dict["val_accuracy"]

plt.plot(epochs, acc, "bo", label="Training acc")

plt.plot(epochs, val_acc, "b", label="Validation acc")

plt.title("Training and validation accuracy")

plt.xlabel("Epochs")

plt.ylabel("Accuracy")

plt.legend()

plt.show()

Retraining the model from scratch

model = keras.Sequential([

layers.Dense(16, activation="relu"),

layers.Dense(1, activation="sigmoid")

])

model.compile(optimizer="rmsprop",

loss="binary_crossentropy",

metrics=["accuracy"])

model.fit(x_train, y_train, epochs=4, batch_size=512)

results = model.evaluate(x_test, y_test)

results

USING A TRAINED MODEL TO GENERATE PREDICTIONS ON NEW DATA

model.predict(x_test)

-------------------------

After using above model, I modified my model using below codes:

LOADING THE IMDB DATASET

# Increasing the number of words

from tensorflow.keras.datasets import imdb

(train_data, train_labels), (test_data, test_labels) = imdb.load_data(

num_words=11000)

# Decoding reviews back to text

word_index = imdb.get_word_index()

reverse_word_index = dict(

[(value, key) for (key, value) in word_index.items()])

decoded_review = " ".join(

[reverse_word_index.get(i - 3, "?") for i in train_data[0]])

decoded_review

DATA PREPARATION

Encoding the integer sequences via multi-hot encoding

import numpy as np

def vectorize_sequences(sequences, dimension=11000):

results = np.zeros((len(sequences), dimension))

for i, sequence in enumerate(sequences):

for j in sequence:

results[i, j] = 1.

return results

x_train = vectorize_sequences(train_data)

x_test = vectorize_sequences(test_data)

y_train = np.asarray(train_labels).astype("float32")

y_test = np.asarray(test_labels).astype("float32")

BUILDING THE MODEL

# Changing activation function from relu to tanh and changing the number of dense layers to 3 and its units from 32 to 64

from tensorflow import keras

from tensorflow.keras import layers

model = keras.Sequential([

layers.Dense(64, activation="tanh"),

layers.Dense(1, activation="sigmoid")

])

# Changed loss function to mean_squared_error

model.compile(optimizer="rmsprop",

loss="mean_squared_error",

metrics=["accuracy"])

VALIDATING THE MODEL

x_val = x_train[:11000]

partial_x_train = x_train[11000:]

y_val = y_train[:11000]

partial_y_train = y_train[11000:]

# Keeping number of epochs at 10

history = model.fit(partial_x_train,

partial_y_train,

epochs=10,

batch_size=512,

validation_data=(x_val, y_val))

history_dict = history.history

history_dict.keys()

Plotting the training and validation loss

import matplotlib.pyplot as plt

history_dict = history.history

loss_values = history_dict["loss"]

val_loss_values = history_dict["val_loss"]

epochs = range(1, len(loss_values) + 1)

plt.plot(epochs, loss_values, "bo", label="Training loss")

plt.plot(epochs, val_loss_values, "b", label="Validation loss")

plt.title("Training and validation loss")

plt.xlabel("Epochs")

plt.ylabel("Loss")

plt.legend()

plt.show()

plt.clf()

acc = history_dict["accuracy"]

val_acc = history_dict["val_accuracy"]

plt.plot(epochs, acc, "bo", label="Training acc")

plt.plot(epochs, val_acc, "b", label="Validation acc")

plt.title("Training and validation accuracy")

plt.xlabel("Epochs")

plt.ylabel("Accuracy")

plt.legend()

plt.show()

RETRAINING THE MODEL

# Changing the number of epochs to 2 where the val_loss is minimum and val_acc is second maximum

model = keras.Sequential([

layers.Dense(64, activation="tanh"),

layers.Dense(1, activation="sigmoid")

])

model.compile(optimizer="rmsprop",

loss="mean_squared_error",

metrics=["accuracy"])

model.fit(x_train, y_train, epochs=2, batch_size=512)

results = model.evaluate(x_test, y_test)

results

PREDICTING USING THE MODEL

model.predict(x_test)

------------------------------

QUESTION:

1. Is there any problem with my codes and or can you suggest anything we can add to make it better? Please provide code and explanation

Then answer the following questions:

What modification(s) did you make?
How did this impact accuracy?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Q1. You have identified a market opportunity for home media players that would cater for older members of the population. Many older people have difficulty in understanding the operating principles...

If you dissolve 8.14 grams of calcium chloride, CaCl 2 , in enough water to make 125.0 mL of solution, what is the molarity of the solution?

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

1. Suppose you are planning to train a standard recurrent neural network model on a dataset of 100,000 text documents, each exactly 50 words long. You choose to represent each word in a document...

Python code implementation Build the following RNN architecture to classify names based on their language (Dataset). (This dataset includes the names of people in 18 different languages) Do the...

Assignment - Using NLP to play the stock market In this assignment, we'll use everything we've learned to analyze corporate news and pick stocks. Be aware that in this assignment, we're trying to...

customers, suppliers, and communities in as serious takes place in the firm's business model thow to a way as they track its financial performance. make money). - The triple-bottom-line framework is...

Tightness of Trained RNN Language Models ( 1 0 pts ) In this question, we investigate the tightness of RNN language models based on the Elman RNN . We will construct a non - tight RNN language model...

--- Which of the following is a CORRECT description of how transfer learning is applied in image processing neural networks? A. A pre-trained base model is used for the convolutional part of the...

Respond to at least two colleagues who selected a different theory than you did ( for remedial model - I used learning theory for reciprocal model I used system theory). How would their selection of...

In 20X0, Kiranjit Dhillon acquired 1,000 shares of Pluton Ltd. (a Canadian public corporation) at a cost of $21,000 plus a brokerage commission of $600. During 20X0, she received cash dividends of...

The figure in the popup window, a. The expected return. b. The standard deviation of the return. Note: Make sure to round all intermediate calculations to at least five decimal places. shows the...

The fixed cost classifications identified with a time frame perspective are known as:

The balance sheet for December 31, 2011, income statement for the year ended De-cember 31, 2011, and the statement of cash flows for the year ended December 31, 2011, of Bernett Company are shown in...

Explain how the following events would affect the market for the Mexican peso, assuming a floating exchange rate. a. Improvements in Mexican production technology yield superior guitars, and many...

Identify whether each of the following items creates a surplus item or a deficit item in the current account of the U.S. balance of payments. a. A Central European company sells products to a U.S....

Suppose that during a recent year for the United States, merchandise imports were $2 trillion, unilateral transfers were a net outflow of $0.2 trillion, service exports were $0.2 trillion, service...

On June 30, 2018, Franza Company's total current assets were $900,000 and its total current liabilities were $360,000. On July 1, 2018, Franza issued a long-term note to a bank for $72,000 cash....

= ( RawPoints * ( 1 - ( LateTransactions / TotalTransactions ) ) ) / Employees This is my method for finding the total. Is it correct? Should I use multiplication or addition ? , This is my method...

In the 2012 tax year, Michelle paid the following amounts relating to her 2010 tax return: Tax deficiency..........................................$5,000 Negligence...

Phil and Linda are 25-year-old newlyweds and file a joint tax return. Linda is covered by a retirement plan at work, but Phil is not. a. Assuming Phil's wages were $27,000 and Linda's wages were...

Sally and Charles Heck received the following dividends and interest during 2012: Assuming the Hecks file a joint tax return, complete Schedule B of Form 1040 (on page 2-33) for them for the 2012 tax...

8.7 Explain how cultures influence the perception of time.

The symbolizing three dimensions in two theory suggests that people in Western cultures focus more on representations on paper than do people in other cultures and spend more time learning to...

2. Attributional biases may be more universal than previously thought. Cultural differences in attributional styles clearly exist. But are there contexts in which attributional styles may be...