Question: We are familiar with binary Huffman codes. Input symbols are converted into codewords comprising a sequence of O's and 1's, or bits. In a ternary

We are familiar with binary Huffman codes. Input symbols are converted into codewords comprising a sequence of O's and 1's, or bits. In

We are familiar with binary Huffman codes. Input symbols are converted into codewords comprising a sequence of O's and 1's, or bits. In a ternary Huffman code, codewords comprise a sequence of trits, each of which can have one of three values: 0,1 or 2. The Huffman coding algorithm to form ternary codewords is the same as for binary codewords except three symbols are combined into a new metasymbol at each step. (To make the combining work out, sometimes an additional O-frequency placeholder symbol is used, but in the examples below that won't be necessary.) 1. Given the following input alphabet and corresponding frequencies (expressed as prob- abilities): Symbol Frequency 0.50 B 0.26 0.12 0.06 E 0.06 D (a) Build the corresponding binary Huffman code. (b) Compute the expected bits per symbol for this binary Huffman code. Remember that this is weighted by the probability of occurrence of each symbol. (c) Build the corresponding ternary Huffman code. (d) Compute the expected trits per symbol for this ternary Huffman code. (e) Convert the expected trits to expected bits per symbol by multiplying your previous answer by 1.585. 2. Repeat the same 5 steps from part 1) with the following input alphabet and corre- sponding frequencies (expressed as probabilities): Symbol Frequency 0.35 0.35 0.10 0.10 0.10 3. In information theory, the term entropy refers to the expected amount of information contained in one random symbol from a given probability distribution. The more "unknown the value is, the higher the entropy. For example, flipping a coin has an entropy of 1 bit of information. If the coin is weighted so that it almost always comes up heads, then flipping that coin has an entropy of less than 1 bit, because there is less uncertainty about the expected outcome. The entropy eventually drops all the way to zero if the outcome is known (heads always comes up). Entropy also represents a bound on the efficiency possible when encoding symbols from a probability distribution. Huffman coding yields an optimal prefix-code, but because it requires an integer number of bits (or trits) per codeword, it doesn't necessarily reach the entropy limit for efficiency. For a given probability distribution, entropy in expected bits per symbol is computed as: H(X) = - p(x)log2(p(2)). In this equation, X is the random variable, H(X) is the entropy of the random variable, x represents a symbol (outcome) from the distribution, and p(x) represents the probability of occurrence of that symbol. The equation sums over all of the possible symbols in this distribution. (a) Compute the entropy (in expected bits per symbol) for the probability distri- bution from part 1). All you need to do this is the probabilities from the table, plugged into the equation. (b) Compute the entropy (in expected bits per symbol) for the probability distri- bution from part 2). 4. Draw conclusions. In which example did the binary Huffman code achieve an effi- ciency closer to entropy? In which example did the ternary Huffman code achieve an efficiency closer to entropy? What kind of probabilities does a binary Huffman code seem best suited to encode efficiently? What kind of probabilities does a ternary Huffman code seem best suited to encode efficiently? We are familiar with binary Huffman codes. Input symbols are converted into codewords comprising a sequence of O's and 1's, or bits. In a ternary Huffman code, codewords comprise a sequence of trits, each of which can have one of three values: 0,1 or 2. The Huffman coding algorithm to form ternary codewords is the same as for binary codewords except three symbols are combined into a new metasymbol at each step. (To make the combining work out, sometimes an additional O-frequency placeholder symbol is used, but in the examples below that won't be necessary.) 1. Given the following input alphabet and corresponding frequencies (expressed as prob- abilities): Symbol Frequency 0.50 B 0.26 0.12 0.06 E 0.06 D (a) Build the corresponding binary Huffman code. (b) Compute the expected bits per symbol for this binary Huffman code. Remember that this is weighted by the probability of occurrence of each symbol. (c) Build the corresponding ternary Huffman code. (d) Compute the expected trits per symbol for this ternary Huffman code. (e) Convert the expected trits to expected bits per symbol by multiplying your previous answer by 1.585. 2. Repeat the same 5 steps from part 1) with the following input alphabet and corre- sponding frequencies (expressed as probabilities): Symbol Frequency 0.35 0.35 0.10 0.10 0.10 3. In information theory, the term entropy refers to the expected amount of information contained in one random symbol from a given probability distribution. The more "unknown the value is, the higher the entropy. For example, flipping a coin has an entropy of 1 bit of information. If the coin is weighted so that it almost always comes up heads, then flipping that coin has an entropy of less than 1 bit, because there is less uncertainty about the expected outcome. The entropy eventually drops all the way to zero if the outcome is known (heads always comes up). Entropy also represents a bound on the efficiency possible when encoding symbols from a probability distribution. Huffman coding yields an optimal prefix-code, but because it requires an integer number of bits (or trits) per codeword, it doesn't necessarily reach the entropy limit for efficiency. For a given probability distribution, entropy in expected bits per symbol is computed as: H(X) = - p(x)log2(p(2)). In this equation, X is the random variable, H(X) is the entropy of the random variable, x represents a symbol (outcome) from the distribution, and p(x) represents the probability of occurrence of that symbol. The equation sums over all of the possible symbols in this distribution. (a) Compute the entropy (in expected bits per symbol) for the probability distri- bution from part 1). All you need to do this is the probabilities from the table, plugged into the equation. (b) Compute the entropy (in expected bits per symbol) for the probability distri- bution from part 2). 4. Draw conclusions. In which example did the binary Huffman code achieve an effi- ciency closer to entropy? In which example did the ternary Huffman code achieve an efficiency closer to entropy? What kind of probabilities does a binary Huffman code seem best suited to encode efficiently? What kind of probabilities does a ternary Huffman code seem best suited to encode efficiently

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

A creative engineer suggests structuring the TLB so that not all the bits of the presented address need match to result in a hit. Suggest how this might be achieved, and what might be the costs and...

A bitstring is just a sequence of bits, e.g. 10010110. Normally, printable characters in a file are encoded as bytes, or bitstrings of length 8. This is called ASCII code. For example, the ASCII code...

The following program needs to be in c. I need guidlines/advice on how to make the above program, as someone who does not understand bitwise operators. Your program cannot use string functions to...

Briefly describe ASCII and Unicode and draw attention to any relationship between them. [3 marks] (b) Briefly explain what a Reader is in the context of reading characters from data. [3 marks] A...

Suppose that R(A, B, C) is a relational schema with functional dependencies F = {A, B C, C B}. (i) Is this schema in 3NF? Explain. [2 marks] (ii) Is this schema in BCNF? Explain. [2 marks] (b)...

Prolog You are approached to compose a Prolog program to work with twofold trees. Your code shouldn't depend on any library predicates and you ought to expect that the mediator is running without...

In this problem Huffman codec. The termcodecis short for a compression/decompression (or coding/decoding) scheme. Your codec will implement thea3.Codecinterface, which means it must supply two...

Describe and justify an algorithm for finding the shortest distance between each pair of vertices in an undirected graph in which each edge has a given positive length. If there is no path between a...

Please write matlab codes and annotations for this question HUFFMAN ALGORITHM Step 1: Arrange the symbols in order of decreasing probability. If the probabilities are equal, break the tie randomly....

Universal Bank pays 7% interest, compounded annually, on time deposits. Regional Bank pays 6% interest, compounded quarterly. a. Based on effective interest rates, in which bank would you prefer to...

Teresa Alvarez has prepared the following list of statements about the general ledger. 1. The general ledger contains all the asset and liability accounts but no owners equity accounts. 2. The...

Valuation of financial assets requires knowledge of a company evaluation be inappropriate discount rate see past asset performance, the future cash flow in inappropriate discount rate

Scenario: A Multi-national Corporation called The Globe has created a subsidiary called Thirst in an under-developed country called non-potable waters. This hosting country suffers from a severe...

3 How prevalent is the competences approach, and what is its perceived relevance in different countries?

3 How relevant is the competences approach for Andersen Consulting working outside Anglo-Saxon cultures?

2 What are the competences and capabilities required of new recruits to Andersen Consulting, given the need to maintain its strong corporate culture and ways of doing things?