Question: ICT 3 7 1 Artificial Intelligence Tutorial 1 1 Week 1 1 : Clustering Twitter data in rapid miner studio Task: Use K - Means

ICT371 Artificial Intelligence
Tutorial 11
Week 11: Clustering Twitter data in rapid miner studio
Task: Use K-Means Clustering to Identify Twitter Topics in RapidMiner
Import the twitter dataset from the week 11 section of the moodle page. After importing, do the following:
Now we have to clean the data and the first step is to convert the data into text and do some text processing. Add Nominal to Text operator into the process. Now use Replace operator to remove any URL in the tweets, you can also remove any @ data as well. We have to use a regular expression to remove the URL which is (http(s)?):VV(www.)?a-zA-Z0-9@:%._|+*#=](2,256}\.[a-z](2,61\bI-a-
ZA-Z0-9@%_\+.~#2&//=*
Parameters
Replace
stribute filer bype v
invert selection
incude special attributes
replace what
Khttps23Ww?a-24.20-3@%_1~8=12.256113-212.6100

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!