Question: 3 - Deep learning accelerator Design and implement a verilog module called perceptron . We will use fixed point format in our simple single layer

3 -

Deep learning accelerator Design and implement a verilog module called

perceptron

.

We will use fixed point format in our simple single layer perceptron. It should follow the following specification. Top level Port name Direction Width Description rst

_

n Input

0

0

Active low reset clk Input

0

0

Clock x

1

Input

7

0

Signed fixed point x

2

Input

7

0

Signed fixed point valid

_

in Input

0

0

Inputs valid y Output

0

0

Unsigned

(0

1)

_

valid Output

0

0

Y valid, driven by design Perceptron should also contain internally

(

Read Only Memory

-

ROM

) -

b reg

7

0

Bias

-

Signed fixed point w

1

reg

7

0

Weight

-

Signed fixed point w

2

reg

7

0

Weight

-

Signed fixed point Please use the following values signed

[7

0]

1 = 8'

00000010

; signed

[7

0]

2 = 8'

11111110

; signed

[7

0]

= 8'

11111101

; Fixed point x

1,

2,

,

1,

2

all have an implicit decimal point in the middle of the

4

least significant bits i

.

.

/

w bit

2

and bit

1 .

Hence, every signed fixed point number

= (

Value if it were a signed integer

) / 4, 000001012 = 1.25 (= 5 / 4) 111110002 = - 2 (= - 8 / 4) 111110102 = - 1.5 (= - 6 / 4)

etc You may use the same adder and multiplier that work with signed integers, but will need to account for the decimal point. If we add two numbers of the above type, the decimal point is still

2

places from the least significant bit

(

LSB

) .

But if we multiply two such numbers, the implicit decimal point is

4

places from the least significant bit, so to get the correct fixed point product, we need to shift it right by

2

in this case. Working

-

The perceptron must be pipelined

(

each stage

1

clock cycle and computation on new inputs begins every clock cycle

) .

In first stage, it uses

2

signed multipliers to calculate the products p

1 =

1 *

1

and p

2 =

2 *

2 .

In the second stage, it adds the p

1,

2

and b to get an intermediate signed output

-

.

For a deep learning accelerator we will over provision, hence, we will use multiple multipliers and adders such that all multiplications may be finished in

1

stage

(

clock cycle

)

and all additions may be finished in one stage

(

clock cycle

) .

For now, we just want to get it functionally correct and pipelined. At a later point we may want to refine the design to be our accelerator. You may use

*

and

+

sign now, but at some point we may want to use the IP components and may want to pipeline the multiplier for best performance. You may save your self future effort by using the library components now itself

(

be aware that will change clock cycles, no extra marks

) .

After the addition

(

after the

2

nd stage

),

it asserts or de

-

asserts y based on

-

= 1

if s

> = 0 0

Otherwise Hence, latency

= 2

Clock cycles You must use internal registers of appropriate size to contain the complete intermediate results.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Q 3 - Deep learning accelerator Design and implement a verilog module called "perceptron". We will use fixed point format in our simple single layer perceptron. It should follow the following...

A creative engineer suggests structuring the TLB so that not all the bits of the presented address need match to result in a hit. Suggest how this might be achieved, and what might be the costs and...

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Good communication is just as stimulating as black coffee and just as hard to sleep after. - Anne Morrow Lindbergh In May 2021, David Black, CEO of Blackbox, ended his Zoom call with a sense of...

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

Computer Organization and Networks Practicals 2021/22 October 9, 2021 Computer Organization and Networks Practicals 2021/22 b68495714b Contents Contents 0 Introduction 3 0.1 Registration . . . . . ....

There are two problems due this week (each worth 35 points) as follows. Problem 1.6 (page 20) In comprehensive paragraphs, answerrequirements a to e. You will have 5 paragraphs total of four to five...

digital system design with fgpa i need solution ASAP Objective: The objective of this lab is to introduce the student to parameterized modules in Altera and to use these modules to implement a simple...

London School of Science & Technology Qualification Unit number and title BTEC Level 5 HND Diploma Business UNIT 6: Business Decision Making Student name and ID number Assessor name Al Hassan Barrie...

______ demand refers to the fact that total demand for a product is not affected much by price changes. Consumer Inelastic Elastic Business

The half-life of iodine-125 is about 60 days. If we were to start with 64 g of it, about how much would remain after 1 year?

Profit margin measures the relation of debt to assets. Group startsTrue or False True, unselectedFalse

Karen promises her daughter, Amy, that if she quits smoking for 6 months, she will pay Amy $500 . Amy agrees to the arrangement. After Amy fulfills her promise and refrains from smoking for six months