Given the two files DNA SEQ, and PROT SEQ compress them using ( i ) Binary coding, ( ii ) Huffman coding For binary coding if you have n unique letters, then you can use the binary representation for each letter For example, if the text has 8 unique letters, then you can code the letters using three ( log 2 ) bits as , 0 0 0 , 0 0 1 , 0 1 0 , 0 1 1 , 1 0 0 , 1 0 1 , 1 1 0 , 1 1 1 For Huffman coding use the method discussed in the slides As usual, for either method you can use or adapt codes available online ( i ) Compare the memory required to store ( a ) the DNA SEQ using Binary and Huffman coding ( b ) PROT SEQ using Binary and Huffman coding ( 1 0 ) ( ii ) Discuss why the DNA SEQ did not show as significant savings as the PROT SEQ ( 1 0 ) ( iii ) Develop an algorithm by which you can modify how you store the DNA SEQ so that you obtain better saving than the binary method This has to be a lossless compression, similar to Huffman coding You only have to describe the algorithm in detail ( no code needed ) and explain why it will improve the storage ( 2 0 )

The Answer is in the image, click to view ...

Question: Given the two files DNA _ SEQ, and PROT _ SEQ compress them using ( i ) Binary coding, ( ii ) Huffman coding. For

Given the two files DNA

_

SEQ, and PROT

_

SEQ compress them using

(

)

Binary coding,

(

)

Huffman coding.

For binary coding if you have n unique letters, then you can use the binary representation for

each letter. For example, if the text has

8

unique letters, then you can code the letters using three

(

log

2)

bits as

, 000, 001, 010, 011, 100, 101, 110, 111 .

For Huffman coding use the method discussed

in the slides. As usual, for either method you can use or adapt codes available online.

(

)

Compare the memory required to store

(

)

the DNA

_

SEQ using Binary and

Huffman coding

(

)

PROT

_

SEQ using Binary and Huffman coding.

(10)

(

)

Discuss why the DNA

_

SEQ did not show as significant savings as the PROT

_

SEQ

(10)

(

iii

)

Develop an algorithm by which you can modify how you store the DNA

_

SEQ so that you

obtain better saving than the binary method. This has to be a lossless compression, similar to

Huffman coding. You only have to describe the algorithm in

detail

(

no code needed

)

and explain why it will improve the storage.

(20)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Describe and justify an algorithm for finding the shortest distance between each pair of vertices in an undirected graph in which each edge has a given positive length. If there is no path between a...

Write a program that implements the Huffman coding compression algorithm using priority queues and binary trees. Huffman coding is an algorithm devised by David A. Huffman of MIT in 1952 for...

Problem Write a program that implements the "Huffman coding" compression algorithm using priority queues and binary trees. Huffman coding is an algorithm devised by David A. Huffman of MIT in 1952...

Digital Communication I X A B C D Y Hosts X and Y are communicating through the data network provided by the switches A, B, C and D and the links interconnecting them as shown above. Initially all...

f a processor exhibited one branch delay slot how would you reorder (and possibly modify) the instructions in the following loop to gain a performance advantage? loop ldr r2,r3,#4 % r2=load(r3),...

A bitstring is just a sequence of bits, e.g. 10010110. Normally, printable characters in a file are encoded as bytes, or bitstrings of length 8. This is called ASCII code. For example, the ASCII code...

A bitstring bs just a sequence of bits, e.g. 10010110. Normally, pirintable characters in a file are encoded as bytes, or bitstrings of length 8. This is called ASCII oode. For example, the ASCII...

Lab #6 HUFFMAN CODING Huffman coding is an algorithm devised by David A. Huffman of MIT in 1952 for compressing text data to make a file smaller (fewer bytes). This relatively simple algorithm is...

Design and implement the Huffman Tree using Queue and PriorityQueue from the previous assignments. The Huffman Tree will be further used to encode input strings. Your task is to develop a simple,...

A network based service manages persistent objects. The service must enforce an access control policy to protect the objects. (a) Discuss how this access control might best be implemented for the...

Hops originate from the flowers of Humulus lupulus and are used primarily as a flavoring and stability agent in beer. Hops have several characteristics that are very favorable to beer: Hops...

We wish to use an NMOS transistor as a variable resistor with Ron = 500capital omega at VGS = 1 V and Ron = 400 capital omega at VGS = 1.5 V. Is it possible to design such NMOS transistor and why?...

Considering that M&A activity often involves paying a premium for an acquired firm, how can a company ensure that the synergies from the merger or acquisition will create value beyond the premium...

Finland has an elevated rate of homicide due to easy access to handguns in their country. Group of answer choices True False