Question: Given the following dataset for two classes Non-technical (-1) and technical (1). Technical article refers to an article relating to technology and science and non-technical
Given the following dataset for two classes Non-technical (-1) and technical (1).
Technical article refers to an article relating to technology and science and non-technical refers to other work of fictions. The data has two features: Average time required to read an article, i.e. Time and number of sentences in the article i.e. Sentences (2.2 represents 2.2k sentences or 2200 sentences).
Training is a process of finding the optimal value of coefficients (for our case B1 and B2).
| time(hr) | sentences(1=1000) | Article type |
| 2.7 | 2.5 | 0 |
| 1.4 | 2.3 | 0 |
| 1.3 | 2.4 | 0 |
| 1.3 | 1.8 | 0 |
| 3 | 3 | 0 |
| 7.6 | 2.7 | 1 |
| 5.9 | 2.2 | 1 |
| 6.9 | 1.8 | 1 |
| 8.6 | 3.5 | 1 |
| 7.7 | 3.5 | 1 |
- Plot of the two classes Non-technical (-1) and technical (1) in Excel
- Suggest and plot the hyperplane in Excel.
- Use Weka tool to find the equation and its coefficients to predict the class of new sample whose Time= 1 Hour and Sentences = 2.4 thousand sentences. What is the prediction technical or non-technical article.
- Use google to plot the graph
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
