Question: The following code performs matrix multiplication in c + + . float dot _ product _ f ( float * m 1 , float *

The following code performs matrix multiplication in c

+ + .

float dot

_

product

_

(

float

*

1,

float

*

2,

int r

,

int c

,

int l

)

{

float product

= 0

;

for

(

int i

= 0

; i

<

l; i

+ +)

{

float a

=

1 [

+

*

]

;

float b

=

2 [

+

*

]

;

product

+ =

*

/ /

printf

(" % 1

* % 1

+ ",

,

)

;

}

/ /

printf

("

")

;

return product;

}

The following is the assembly for the above c

+ +

code. The requirement is to remove redundant instructions to improve performance. Please provide new working assembly file with improved code. Thanks.

.

text

.

file

"

_

float.cpp

"

.

globl

_

13

dot

_

product

_

fPfS

_

iii #

- -

Begin function

_

13

dot

_

product

_

fPfS

_

iii

.

2

align

4, 0

90

.

type

_

13

dot

_

product

_

fPfS

_

iii,@function

_

13

dot

_

product

_

fPfS

_

iii: # @

_

13

dot

_

product

_

fPfS

_

iii

.

cfi

_

startproc

%

. 0

pushq

%

rbp

.

cfi

_

def

_

cfa

_

offset

16

.

cfi

_

offset

%

rbp

, - 16

movq

%

rsp

, %

rbp

.

cfi

_

def

_

cfa

_

%

rbp

movq

%

rdi,

- 8 (%

rbp

)

movq

%

rsi,

- 16 (%

rbp

)

movl

%

edx,

- 20 (%

rbp

)

movl

%

ecx,

- 24 (%

rbp

)

movl

%

8

, - 28 (%

rbp

)

xorps

%

xmm

0, %

xmm

0

movss

%

xmm

0, - 32 (%

rbp

)

movl $

0, - 36 (%

rbp

)

.

LBB

0_1

: #

= >

This Inner Loop Header: Depth

= 1

movl

- 36 (%

rbp

), %

eax

cmpl

- 28 (%

rbp

), %

eax

jge

.

LBB

0_4

%

. 2

: # in Loop: Header

=

0_1

Depth

= 1

movq

- 8 (%

rbp

), %

rax

movl

- 36 (%

rbp

), %

ecx

movl

- 20 (%

rbp

), %

edx

imull

- 28 (%

rbp

), %

edx

addl

%

edx,

%

ecx

movslq

%

ecx,

%

rcx

movss

(%

rax,

%

rcx

, 4), %

xmm

0

# xmm

0 =

mem

[0],

zero,zero,zero

movss

%

xmm

0, - 40 (%

rbp

)

movq

- 16 (%

rbp

), %

rax

movl

- 24 (%

rbp

), %

ecx

movl

- 36 (%

rbp

), %

edx

imull

- 28 (%

rbp

), %

edx

addl

%

edx,

%

ecx

movslq

%

ecx,

%

rcx

movss

(%

rax,

%

rcx

, 4), %

xmm

0

# xmm

0 =

mem

[0],

zero,zero,zero

movss

%

xmm

0, - 44 (%

rbp

)

movss

- 40 (%

rbp

), %

xmm

0

# xmm

0 =

mem

[0],

zero,zero,zero

movss

- 44 (%

rbp

), %

xmm

2

# xmm

2 =

mem

[0],

zero,zero,zero

movss

- 32 (%

rbp

), %

xmm

1

# xmm

1 =

mem

[0],

zero,zero,zero

mulss

%

xmm

2, %

xmm

0

addss

%

xmm

1, %

xmm

0

movss

%

xmm

0, - 32 (%

rbp

)

%

. 3

: # in Loop: Header

=

0_1

Depth

= 1

movl

- 36 (%

rbp

), %

eax

addl $

1, %

eax

movl

%

eax,

- 36 (%

rbp

)

jmp

.

LBB

0_1

.

LBB

0_4

movss

- 32 (%

rbp

), %

xmm

0

# xmm

0 =

mem

[0],

zero,zero,zero

popq

%

rbp

.

cfi

_

def

_

cfa

%

rsp

, 8

retq

.

Lfunc

_

end

0

.

size

_

13

dot

_

product

_

fPfS

_

iii,

.

Lfunc

_

end

0 -_

13

dot

_

product

_

fPfS

_

iii

.

cfi

_

endproc

- -

End function

.

ident "clang version

17.0.6 (

CentOS

17.0.6 - 5 .

9) "

.

section

" .

note.GNU

-

stack","",@progbits

.

addrsig

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

The following code performs matrix multiplication in c + + . It takes integer pointers int dot _ product _ i ( int * m 1 , int * m 2 , int r , int c , int l ) { int product = 0 ; for ( int i = 0 ; i...

Introduction: Write a C program that performs matrix multiplication. For this program, a matriz is a rectangular array of integer entries, like this 1 2 3 -4 L9 10 11 12 More generally, define...

Introduction: Write a C program that performs matrix multiplication. For this program, a matriz is a rectangular array of integer entries, like this 1 3 -4 5 6 7 8 9 10 11 12 More generally, define...

This code is done is MATLAB and I need to debugg it: The Matrix to be used for testing are the following: Program for matrix multiplication Challenge #01...

On January 1, Year 5, Pic Company acquired 7,500 ordinary shares of Sic Company for $699,000. On January 1, Year 6, Pic Company acquired an additional 2,000 ordinary shares of Sic Company for...

Lab 1 Direction: Submit the typed source code. Class Statistics For this lab, you will be evaluating the class from assignment 1. To recap, a students record consists of the following assignments:...

What will be the output of the following code snippets and briefly explain the logic: (C PROGRAM PLEASE) 5. int main(void) { int a = 3, b = 4, c; c = b a; switch (c) { case 1 || 2: printf("God give...

3 - Provide the final values after executing the following code snippit 20pts #include float fun2(float zoo, float soo)X int fun1 (int* a, int b); float fun2(float a, float b); return *soo; int...

C language not C++ 1. Write the statements to do the following: (2 pts) a. Define a struct with member variables width, height, topleft x, topleft y all floats). Use a tag to call it Rectangle. b....

A DC servomotor is used to actuate one of the axes of an x-y positioned. The motor has a torque constant of 8.75 in-lb/A and a voltage constant of 10 V/(1000 rev/min). The armature resistance is 2.0...

5. Sodium benzoate is the conjugate base of benzoic acid and it is used as a food preservative. How would you make a buffer solution based on this system to have a pH = 5.0? 6. What part of the...

The terms face value, par value, maturity value, and terminal value all have the same meaning in the bond markets

Required Information [The following information applies to the questions displayed below.] A company makes the payment of a one-year insurance premium of $4,464 on March 1, 2019. -1. Use the...