Question: grammarQuestion 3 : Efficiently Simulating Bounded Stacks with RNNs ( 2 0 pts ) As we discussed in the lecture, hierarchical structure, characterized by long

grammarQuestion

3

: Efficiently Simulating Bounded Stacks with RNNs

(20

pts

)

As we discussed in the lecture, hierarchical structure, characterized by long distance and nested

dependencies, lies at the core of the human language. Indeed, this motivated our theoretical discussion

of context

-

free grammars as a useful paradigm for describing language. Studying modern language

models with respect to their ability to efficiently represent hierarchical structure, therefore, provides

evidence that they are useful models for human language. This question directly addresses this point.

More precisely, we will study the ability of recurrent neural network language models to recognize a

variant of the

D (k)

languages.

As introduced in the lecture notes, the

D (k)

languages are in a way archetypal context

-

free languages.

Recognizing

D (k)

languages is conceptually simple

-

a system has to remember the sequence of currently

non

-

closed opening brackets and make sure they are closed in the correct order, at which point the

closed bracket pairs can be "forgotten", i

.

.,

popped off the stack. This means that the memory

necessary to recognize any string in

D (k)

is proportional to the number of non

-

closed brackets at any

time. We formalize it by counting how many more open brackets than closed brackets there are at

each timestep in the string:

d (y_{?} t) = ?^{d e f}

count

(y_{?} t, (

) -

count

(y_{?} t,

))

where count refers to the number of times any opening bracket occurs in

y_{?} t

and count

{

(y_{?} t,

))

the number of times any closing bracket occurs in

y_{?} t .

While context

-

free languages like

D (k)

describe arbitrarily deep hierarchical structures, natural lan

-

guages exhibit bounded nesting in practice, as discussed in the lecture. Furthermore, the infinite

nesting and therefore infinitely long stacks also make it impossible to represent context

-

free languages

with finite precision. In this question, we investigate how to represent

D (k)

languages which can only

nest up to some bounded depth

m .

We denote such languages as

D (k, m) .

Definition languages

) .

Let

k,

minN. We define the bounded Dyck language

D (k, m)

by combining

D (k)

with a bound on the nesting depth:

D (k, m) = ?^{d e f} {y i n D (k) | d (y_{?} t) m, t = 1,

dots,

T},

where

T

corresponds to the length of the string.

Due to their bounded nesting depth,

D (k, m)

languages can be recognized by stacks of bounded depth

and therefore with bounded memory

-

this means that they are in fact finite

-

state. This makes them

especially well

-

suited as a benchmark for finite

-

precision language models.

This question is roughly divided into two parts: in the first part, you will show that an Elman RNN is

able to simulate a finite

-

state automaton that recognizes the

D (k, m)

for some

k

and

m .

In the second

part of the question, you are asked to show that the RNN indeed recognizes the language as well using

a specific definition of acceptance.

We begin with a warm

-

up question.

) (1

)

Suppose that the current bounded stack configuration in an automaton recognizing

L = D (2, 3)

What are the new stack configurations

_{1}^{'},_{2}^{'},_{3}^{'}

after reading in each of the following symbols

(

each one starting from a stack

_{1},_{2},_{3},

not one after another

) ?

Use

\frac{O}{?}

to denote an empty stack and simply state that the automaton would reject a string if the

processed string is not in

D (2, 3) .

Note: You have to specify nine stack configurations altogether.

We next prove that

D (k, m)

languages are in fact finite

-

state by constructing an FSA recognizing

D (k, m) .

grammarQuestion 3 : Efficiently Simulating

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

This is important information needed to solve the question This is the question. Only solve the "Simulating channel" and "Simulating receiver" In coding theory, the repetition code is one of the most...

Moving to another question will save this response Question 1 in a Tollgate single channel queue, the customers are serviced on a first come test served basis. Customers arrive random from table...

they are all comprehension questions unrelated to your own opinion or personal beliefs. Each answer must be consistent with the text. In addition to that, you also need to refer to the text with an...

Goals: 1. Perform use-case analysis techniques to discover and specify the conceptual classes. 2. Use design principles to translate conceptual class design into an appropriate set of abstract and...

For the inventory simulation of Sec. 1.5, suppose that the inventory is perishable , having a shelf life distributed uniformly between 1.5 and 2.5 months. That is, if an item has a shelf life of l...

This homework is about Process/Thread Synchronization. Please label your answers a) and b) so I know which one is which. Thanks!!! HERE IS THE GIVEN CODE TO USE TO ANSWER PARTS A) AND B) 3. Consider...

Problem 1 (70p): Risk modeling of a portfolio of stocks Select a stock with ticker symbol that starts with your last name initial followed by first name initial, for which there is available stock...

This homework is about Process/Thread Synchronization. PLEASE label your answer a) and b) so I know which one is which. Also keep the code simple as you can. Thanks!!! Here is the code used to do...

This Quiz subject is Game Programming. Only the "Unity Hub" expert should answer this question. Unity Asset Store ability and use? To provide free Unity projects To sell Unity licenses To provide...

HRMT 301: Jean Cook Case (by Mark Julien) I Jean anxiously waited outside the door of her manager. Today was performance appraisal day and Jean had no idea of what awaited her. It was the one year...

Select income statement data for Bukasy Company for two recent years ended December 31 are as follows: Prepar e horizontal and venKal analyses of Bukasys income statement. (Round percentages to one...

Using the sample data from Exercise 2, construct a 95% confidence interval estimate of the mean time that Disney animated children's movies show tobacco use. In Exercise 2 0 223 158 37 1 165 223

Use this checklist to decide what feedback you would offer each student to help them improve their summary writing. Have they used a formal register - no chatty asides or informal expressions? Is...

What is the amount of costs traceableto specific products? $234,000$228,000$120,000$108,000 Production costs for July are Direct materials $120,000 Direct labor 108.000 Factory overhead 6,000 What is...