Question: I need help Implementing assignment using c++. Assignement: Purpose To explore a meaningful use of complex binary trees To implement a real algorithm still in

I need help Implementing assignment using c++.

Assignement:

Purpose

To explore a meaningful use of complex binary trees To implement a real algorithm still in use today Background and Instructions

Lossless data compression remains a key interest in computer science. The goal is to represent data using fewer bits than the human-readable data normally requires, as compressed data is more easily stored and transmitted. Decompression can later restore the data to its original state.

It is almost a guarantee that you have used the ZIP archive file format at some point in your life; it is natively supported in Microsoft Windows, Mac OS X, and most free operating systems. It's quite possible you have used it earlier today. Although the file format accepts many compression algorithms, the most common is DEFLATE. This algorithm uses the LZ77 algorithm and Huffman coding to compress data. Huffman coding creates a prefix code for each character based on the frequency of the character, with the most used characters receiving the shortest codes.

For this assignment, you will implement Huffman coding. You can learn more about the process here, but be warned that the algorithm has some trivial ambiguities that are resolved below. Huffman Coding Algorithm

The algorithm begins by accepting an input string and counting the number of times each character is used. Each character-count pair is stored in a leaf node. Add these leaf nodes to a priority queue, giving lower counts higher priorities and removing the highest priority node first. If two leaf nodes have equal count, the alphabetically smaller character has lower priority. If a leaf node and a compound node (see below) have equal count, the leaf node has lower priority. If two compound nodes have equal count, the alphabetically smaller compound node has lower priority. Remove the first two nodes from the priority queue, combine them into a compound node, and add the new node back into the priority queue. Repeat this process until only one node is left. A compound node has the following properties: Its right child is the first node removed from the priority queue Its left child is the second node removed from the priority queue Its count is equal to the sum of the counts of its two children Its "character" is a concatenation of its left child's character and its right child's character (compound nodes store a string representing the characters its descendent leaves store) The single remaining compound node in the priority queue is the root of the Huffman tree and is ready for encoding and decoding messages. To encode a message, encode each character individually and concatenate the results together. For each character, follow the path from the root of the Huffman tree to the leaf node storing that character; record each "left turn" as a 0 and each "right turn" as a 1. The character's encoding is the resulting bitstream. If the message you are asked to encode contains a character not found in your tree, return an empty string. If asked to encode the empty string, return an empty string. To decode a message, begin at the root of the Huffman tree. For each 0 or 1 in the encoded message, move down the tree left or right, respectively. When a leaf node is reached, record the character it is storing and return to the root of the tree. Continue this process until you reach the end of the encoded message; the last 0 or 1 in the encoded message should take you to the leaf node storing the last character in the decoded message. If the message you are asked to decode ends at a compound node instead of a leaf node, return an empty string. If asked to decode the empty string or any message containing characters other than 0 and 1, return an empty string.

Requirement Notes

As stated above, there are a number of trivial ambiguities in Huffman coding, but in order to facilitate automated testing, you'll need to adhere to the algorithm outlined above. You must make your own priority queue using a maximum heap. This can be tree-based or array/vector-based. "Alphabetically smaller" for this assignment means using the < operator on strings. As an example, the input string "Secret" produces the following encodings: S = 000 c = 001 e = 1 r = 010 t = 011 IMPORTANT NOTES FROM PROFESSOR: - use array - Must build own priority queue. STL queue not allowed. - You can use a map class to burn a tree (whenever you fine a lead node delete.) - SPACE complexity is what we are worried about. The program will not be graded on run time efficiency. - Manage memory correctly. Do not have memory leaks! - Capital letters have highest priority of all - no main needed - only use for your testing purposes.

// steps for program step 1: Priority Queue, burn when tree is built. Step 2: tree builds code to map, burn tree when finished. Step 3: map

functions given by professor that can't be changed: Coder(string sample_text); string encode(string message); string decode(string encoded_message);

coder.h

#pragma once #include using namespace std;

class Coder { public: /** * Constructor; uses the provided sample text as an input string to create * a Huffman tree, which is used for encoding and decoding messages. You may * assume the given sample text will contain at least two distinct characters. * * See the lab specs for full details. */ Coder(string sample_text); /** * Encodes the given string based on the Huffman tree created from the sample text * (provided in the constructor of this class). * * See the lab specs for some details on this method's operations. */ string encode(string message); /** * Decodes the given string based on the Huffman tree created from the sample text * (provided in the constructor of this class). * * See the lab specs for some details on this method's operations. */ string decode(string encoded_message); };

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

(i) Write down the linear program relaxation for the vertex cover problem and solve the linear program. [6 marks] (ii) Based on the solution of the linear program in (b)(i), derive an integer...

ISSUES IN ACCOUNTING EDUCATION Vol. 26, No. 3 2011 pp. 521-545 American Accounting Association DOI: 10.2308/iace-50031 Breach of Data at TJX: An Instructional Case Used to Study COSO and COBIT, with...

I need full code for this project. All the resources are found here. The code for lab 3 is: -) https://ucsb csB.github.io/w19 matni/lab/project01/ Goal and Background The goal of this project is to...

Neehr Perfect Activity: Classifications and Terminology Overview This activity has been developed for the intermediate and advanced EHR user. It explores classification and terminology systems and...

\fSUPPORT FOR THE CULTURAL DIVERSITY AND POLICING (CDAP) PROJECT AND ALL OF ITS PRODUCTS HAVE BEEN PROVIDED BY THE BUREAU OF JUSTICE ASSISTANCE GRANT #2001-DD-BX-K003. OPINIONS STATED IN THIS PAPER...

Chapter 9 Performance Management Skills A leader becomes complete only after giving something back. LAURENCE S. LYONS LEARNING OBJECTIVES By the end of this chapter, you will be able to do the...

ISFM-300 Case Study, Stage 2: Business Process Analysis and Functional Requirements Before you begin this assignment, be sure you: 1. Have completed all previously assigned readings, particularly...

ANSI-SPARC6 Programming Language Compilation Write notes on each of the following topics: (a) the implementation of labels and jumps in a recursive, block structured programming language [7 marks]...

A creative engineer suggests structuring the TLB so that not all the bits of the presented address need match to result in a hit. Suggest how this might be achieved, and what might be the costs and...

This question involves the use of AGGREGATE linear PYTHOIN regression on the Auto data set. (a) Perform a simple linear regression with mpg as the response and horsepower as the predictor. Describe...

Choice Foods Inc. uses activity-based costing to determine product costs. For each activity listed in the left column, match an appropriate activity base from the right column. You may use items in...

Integrated marketing communications (IMC) is the strategic planning of all of an organizations communication efforts around some central message strategies. It is become more and more important to...

Question 6 1 pts The price of Tesla Model S cars will fall because of the following event: cost of lithium batteries used in Teslas increase a fire in a Tesla factory destroys thousands of new cars a...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

14-17 Identify and describe two methods for helping managers select information systems projects.

14-18 Compare the two major types of planning and control tools.

15-1 What major factors are driving the internationalization of business?