Code in Python 3 7 Perform HAC For this function, we would like you to mimic the behaviour of SciPy's HAC function (Links to an external site ), linkage() You may not use this function in your implementation, but we strongly recommend using it to verify your results Input A collection of m observation vectors in n dimensions may be passed as an m by n array (for us, this will be a list of tuples, not a numpy array like for linkage() ) All elements of the condensed distance matrix must be finite, i e no NaNs or infs In our case, m is the number of Pokemon (here 20) and n is 2 the x and y features for each Pokemon (If invalid data points exit, you need to pop them out In this case, m is the number of valid data points) Using single linkage , perform the hierarchical agglomerative clustering algorithm as detailed on slide 19 of our class slidesLinks to an external site Use a standard Euclidean distance function for calculating the distance between two points Output An ( m 1) by 4 matrix Z At the i th iteration, clusters with indices Z i, 0 and Z i, 1 are combined to form cluster m i A cluster with an index less than m corresponds to one of the m original observations The distance between clusters Z i, 0 and Z i, 1 is given by Z i, 2 The fourth value Z i, 3 represents the number of original observations in the newly formed cluster That is Number each of your starting data points from 0 to m 1 These are their original cluster numbers Create an ( m 1)x4 array or list Iterate through the list row by row For each row, determine which two clusters you will merge and put their numbers into the first and second elements of the row The first point listed should be the smaller of the two cluster indexes The single linkage distance between the two clusters goes into the third element of the row The total number of points in the cluster goes into the fourth element If you merge a cluster containing more than one data point, its number (for the first or second element of the row) is given by m the row index in which the cluster was created Before returning the data structure, convert it into a NumPy matrix If you follow these guidelines for input and output, your result should match the result of scipy cluster hierarchy linkage() and you can use that function to verify your results Be aware that this function does not contain code to filter NaN values, so this filtering should be performed before calling the function Tie Breaking In the event that there are multiple pairs of points with equal distance for the next cluster Given a set of pairs with equal distance (xi, xj) where i j, we prefer the pair with the smallest first cluster index i If there are still ties (xi, xj), (xi, xk) where i is that smallest first index, we prefer the pair with the smallest second cluster index Be aware that this tie breaking strategy may not produce identical results to scipy cluster hierarchy linkage()

The Answer is in the image, click to view ...

Question: Code in Python 3.7 Perform HAC For this function, we would like you to mimic the behaviour of SciPy's HAC function (Links to an external

Code in Python 3.7

Perform HAC

For this function, we would like you to mimic the behaviour of SciPy's HAC function (Links to an external site.), linkage(). You may not use this function in your implementation, but we strongly recommend using it to verify your results!

Input: A collection of m observation vectors in n dimensions may be passed as an m by n array (for us, this will be a list of tuples, not a numpy array like for linkage()!). All elements of the condensed distance matrix must be finite, i.e. no NaNs or infs. In our case, m is the number of Pokemon (here 20) and n is 2: the x and y features for each Pokemon. (If invalid data points exit, you need to pop them out. In this case, m is the number of valid data points)

Using single linkage, perform the hierarchical agglomerative clustering algorithm as detailed on slide 19 of our class slidesLinks to an external site.. Use a standard Euclidean distance function for calculating the distance between two points.

Output: An (m-1) by 4 matrix Z. At the i-th iteration, clusters with indices Z[i, 0] and Z[i, 1] are combined to form cluster m + i. A cluster with an index less than m corresponds to one of the m original observations. The distance between clusters Z[i, 0] and Z[i, 1] is given by Z[i, 2]. The fourth value Z[i, 3] represents the number of original observations in the newly formed cluster.

That is:

Number each of your starting data points from 0 to m-1. These are their original cluster numbers.
Create an (m-1)x4 array or list. Iterate through the list row by row.
For each row, determine which two clusters you will merge and put their numbers into the first and second elements of the row. The first point listed should be the smaller of the two cluster indexes. The single-linkage distance between the two clusters goes into the third element of the row. The total number of points in the cluster goes into the fourth element.
If you merge a cluster containing more than one data point, its number (for the first or second element of the row) is given by m+the row index in which the cluster was created.
Before returning the data structure, convert it into a NumPy matrix.

If you follow these guidelines for input and output, your result should match the result of scipy.cluster.hierarchy.linkage() and you can use that function to verify your results. Be aware that this function does not contain code to filter NaN values, so this filtering should be performed before calling the function.

Tie Breaking

In the event that there are multiple pairs of points with equal distance for the next cluster:

Given a set of pairs with equal distance {(xi, xj)} where i < j, we prefer the pair with the smallest first cluster index i. If there are still ties (xi, xj), ... (xi, xk) where i is that smallest first index, we prefer the pair with the smallest second cluster index.

Be aware that this tie breaking strategy may not produce identical results to scipy.cluster.hierarchy.linkage().

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Working with FASTA data Modules you can use: sys (Links to an external site.) collections (Links to an external site.) os (Links to an external site.) re (Links to an external site.) argparse (Links...

This assignment asks you to write bash shell scripts to compute matrix operations. The purpose is to get you familiar with the Unix shell, shell programming, Unix utilities, standard input, output,...

Lincoln International Business School (LIBS) LIBS Vision: To provide an innovative, scholarly learning environment based on a commitment to responsible management practices and a global community...

Python programing problem AND HERE'S THE PYTHON CODE FOR THE INVESTING MACHINE THAT PROVIDED Please write down the python code in programming 1026B: Assignment 2: How to Invest Your Money? Due:...

Please give me an example code in Python 3.7. Thank you very much for your help. 11. Write a function named safe input (prompt, type) that works like the Python input function, except that it only...

Listed in the tables are six springs described in customary units and five springs described in SI units. Investigate these squared-and-ground-ended helical compression springs to see if they are...

Last year a Japanese engineering materials corporation, Yamachi Inc., purchased some U.S. Treasury bonds that return an average of 4% per year. Now, Euro bonds are being purchased with a realized...

Accounting Identity is: Assets equivalent Liabilities minus Owners ' Equity.

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

Know the core strategies of the Wheel of Loyalty that explain how to develop a loyal customer base.

Understand different leadership styles, the importance of role modeling and focusing the entire organization on the frontline.

Know the difference between service climate and culture, and describe the determinants of a climate for service.