Question: When a user accesses a webpage in their web browser, the webpage typically contains multiple web components, such as images, JavaScript codes, Flash content, CSS

When a user accesses a webpage in their web browser, the webpage typically contains multiple web components, such as images, JavaScript codes, Flash content, CSS

,

etc. These components are often down loaded through additional HTTP

(

)

connections from either the first

-

party domain

(

the website the user is

visiting

)

or from third

-

party domains. This article will focus specifically on JavaScript codes, which are com monly used by ad networks, content distribution networks

(

CDNs

),

tracking services, analytics platforms, and online social networks, including Facebook

s use of them for plugins.

The scenario of web tracking via JavaScript codes is depicted in Figure

1 .

When a user accesses a webpage from a first

-

party domain

(

steps

1 2),

the web browser interprets the HTML tags and executes any JavaScript programs within the HTML script tags. These programs may trigger the browser to send additional requests

to retrieve content from third

-

party domains

(

step

3) .

Depending on their intended functionality, JavaScript programs can be considered either useful

(

functional

),

such as fetching content from a CDN

,

or used for tracking purposes. In the latter case, once the webpage has fully loaded

(

step

4),

the JavaScript codes track the user

s activities on the webpage, read from or write to the cookie database

(

steps

5 6),

and potentially reconstruct user identifiers. Tracking JavaScript programs may also be employed to "fingerprint" the user

s browser and system, and transfer sensitive information to third

-

party domains

(

step

7)

Suppose you are tasked with developing a machine learning model based on a single class

(

.

.,

One

-

Class SVM

(

OCSVM

)

or Positive Unlabeled

(

)

Learning

)

to distinguish between functional and tracking JavaScript codes. I have the database which contain multiple Javascript codes

(

TrackingJS and functionalJS

)

Use Term Frequency

-

Inverse Document Frequency

(

-

IDF

)

to extract features from functional and tracking JavaScript codes.

Develop either One

-

Class SVM or PU Learning, and a baseline SVM for comparison, to classify the JavaScript codes.

Design and conduct experiments to validate and test the efficacy of your developed model:

To report any over

-

or under

-

fitting of the models, you may use

60 %

of the data for testing,

20 %

for validation, and

20 %

for the testing.

Report and discuss the parameters of OCSVM or PU Learning model which give your improved results.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

When a user accesses a webpage in their web browser, the webpage typically contains multiple web components, such as images, JavaScript codes, Flash content, CSS , etc. These components are often...

Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...

I have to create a program in C and I can't figure it out. The program has to read a source file. Please help. /******************************************************************** PROJECT: Glossary...

Financial Statement Analysis Obtain a copy of the 2009 annual report for a publicly held firm. Try to obtain an annual report for a retail or manufacturing firm. Be sure that the financial statement...

I want you to make a wireframe just like the sample I provided below!! You should include 3 files: index.html, styles.css and main.js. Do the code in jQuery language. Here is the wireframe I want you...

MAKE SURE TO DOUBLE CHECK YOUR ANSWER PLEASE AND THANK YOU Multiple Choice - choose the one alternative that best completes the statement or answers the question. 1. A determines the capability of...

For the three perpetrators in Exercise 5.64, determine the number of possibilities in which a. just one person is convicted. b. exactly two of the three persons are convicted. c. all three persons...

Define the terms total quality management, just-in-time, and reengineering. What do these terms have in common?

8 4 minutes remaining 5 OF 3 5 QUESTIONS REMAINING Question 3 2 All other things the same, a decrease in variable expense per unit will reduce the break - even point. True False Clear selection

Year-to-date, Sum Brands had earned a 3.20 percent return. During the same time period, Raytheon earned 4.33 percent and Coca-Cola earned 0.48 percent. If you have a portfolio made up of 40 percent...

Some have speculated that in addition to increasing the validity of decisions, employing rigorous selection methods has symbolic value for organizations. What message is sent to applicants about the...

Discuss how the following trends are changing the skill requirements for managerial jobs in the United States: (a) increasing use of computers, (b) increasing international competition.

Does it have correct contact information?