Question: from matplotlib.offsetbox import OffsetImage, AnnotationBbox np . random.seed ( 0 ) plt . figure ( figsize = ( 4 0 , 4 0 ) )

from matplotlib.offsetbox import OffsetImage, AnnotationBbox

.

random.seed

(0)

plt

.

figure

(

figsize

= (40, 40))

# Scatter plot to help with positioning images

plt

.

scatter

(

tsne

_

results

[

, 0],

tsne

_

results

[

, 1],

alpha

= 0.5)

# Loop over each image in the subset and plot at the corresponding t

-

SNE position

for i in range

(

subset

_

size

)

# Get the image corresponding to this t

-

SNE point

image

=

train

_

images

[

] .

reshape

(32, 32, 3)

# Create an OffsetImage object

imagebox

=

OffsetImage

(

image

,

zoom

= 0.7)

# Create the annotation box with the image

=

AnnotationBbox

(

imagebox

, (

tsne

_

results

[

, 0],

tsne

_

results

[

, 1]),

frameon

=

False

)

# Add it to the plot

plt

.

gca

() .

add

_

artist

(

# Title and labels

plt

.

title

('

-

SNE Visualization with Images'

)

plt

.

xlabel

('

-

SNE Component

1')

plt

.

ylabel

('

-

SNE Component

2')

# Show the plot

plt

.

show

()

Q: Based on the last part, how many clusters do you think is good for kMeans? Why?

T: Set up KMeans with some rasonable k based on the previous part

(

no worries: there is no single best answer

) .

Tip: to speed things up

,

import and use MiniBatchKMeans instead of KMeans. Browse the documentation to know how it differs.

[]

. . .

T: Fit kMeans with your data.

[]

. . .

Plot tSNE embedding using clusters' labels

Let's see to what extent the kMeans clusters resemble the structure of the tSNE output.

T: Plot the tSNE embedding again

- -

but this time assign colors corresponding to the kMeans cluster of each image.

Q: Can you see significant groups of points with the same color

(

label

) ? (

If not, something is wrong.

)

How many do you see, roughly?

[]

. . .

T: Repeat the plot above but define the color of each point as the mean color of the images in the cluster to which the image belongs to

.

Hint: You should see some blue and orange parts. Also some almost white and quite dark parts? If yes

- -

good.

If you don't see them

- -

something is likely wrong. Maybe too few iterations? If everything is gray, something is very wrong

- -

maybe way too few clusters

(

) .

Tune the parameters until happy. Do you see why we wanted to use the faster, approximate version of k

-

means? Data analysis is often done iteratively

/

interactively

- -

so efficient algorithms save your time.

[]

. . .

If you're not satisfied with the quality you can tune the parameters some more.

T: If everything looks acceptable, rerun kMeans on the full dataset

- -

something which we couldn't realistically do with tSNE!

[]

. . .

Let's veriify if the clusters we got on the entire dataset are reasonable.

T: For each cluster center, plot, say,

10

images which are closest in the sense of the Euclidean metric to it

.

Q: Looks good? Or maybe you see sometihng suspicious?

For example: if any cluster center look like a single image in the dataset, you likely chose too many clusters!

[]

. . .

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Here are the draft of my Group Project, title is :The impacts of a well-balanced diet on immunity in combating the COVID-19 virus in various countries we are solvong the 3 questions: 1.How many...

Edit question Here are the draft of my Group Project, title is :The impacts of a well-balanced diet on immunity in combating the COVID-19 virus in various countries How many countries adhere to the...

Please answer all Question2 section. I already have question1 section answered. In [ ]: import matplotlib.pyplot as plt import numpy as np import seaborn as sns Problem 1 Data is provided as follows:...

Visualization in python import matplotlib.pyplot as plt import numpy as np import seaborn as sns x = np.linspace(-np.pi, np.pi, 256, endpoint=True) #Return evenly spaced numbers over a specified...

Use python to compute step1-4 (I already compute step1-3 and it is correct, the code of step 1-3 are shown at the picture below, only need help in step 4)PLEASE HELP ME!!! U need to numpy package i...

Using the Scikit-Learn Dataset To load the sample scikit data set, import the datasets module and load the desired dataset. Code Run: from sklearn import datasets import pandas as pd diabetes =...

realy important pls help!!!!!! in phyton Q4: Polynomial Fit - 20 pts The file associated with this question is polyfit.py. Part A. (6 points) Using Numpy create the (x,y) data pairs as follows: x is...

THE QUESTIONS I HAVE ARE MARKED WITH AN ARROW IN THREE SPOTS. PLEASE HELP! Logistic Regression with Gradient Descent: A Classification Exercise - #2 In this exercise you are going to complete the...

mport seaborn as sns import matplotlib.pyplot as plt import pandas as pd # Create the DataFrame from the table data data = { 'Risk Description': [ 'Unwanted Outputs, Bias', 'Lack of Quality,...

Air in a rigid 1 m3 box is at 300 K, 200 kPa. It is heated to 600 K by heat transfer from a reversible heat pump that receives energy from the ambient at 300 K besides the work input. Use constant...

Jordan Wing, Inc., a sporting goods retailer, began operations on January 2, 2012. It reported net income of $3,091,660 during 2014. Additional information about transactions occurring in 2014...

Which of the following is not likely to be interested in earnings information? Group of answer choices Shareholders. Customers. Lenders. None of the above, i . e . they are all interested in earnings...

Suppose the graph of the rational function k(x) has the lines x = -2 andx = 3 as vertical asymptotes, x = 1 and x= 4 as x-intercepts, and a horizontalasymptote at y =1/2. Sketch a possible graph of...