Question: You need to code the project in Python language. Jupyter notebook should be used and a single code file with .ipynb extension should be passed.

You need to code the project in Python language. Jupyter notebook should be used and a single code file with .ipynb extension should be passed. If your computer's processing capacity is not sufficient, it is recommended to use Google Colab.

Classifying News Texts

The aim is to detect real and fake news content shared through social media posts. In the dataset, "target" indicates whether the news is real or fake, while "score" represents the truth/fakeness score of the news. This score is a confidence score of how real the news is. 5 is the highest score and indicates that the news is definitely true. 1 is the lowest score. While a score of 1 indicates that the news is definitely fake, a score of 3 indicates that there is no full opinion on whether the news is fake or real. It means 50 percent fake and 50 percent real.

*In the project, KNN, Decision Tree, Nave Bayes and SVM classification algorithms will be used. At this stage, binary classification will be applied. *Linear Regression, KNN Regression and Decision Tree Regression will be used in the project. This In this section, "score" prediction will be made. * In the project, text pre-processing (stemming, stop) is used to obtain vectors of text data. words etc.) techniques, TF-IDF, n-gram approaches (most popular) will be used. * In the project, feature selection and feature extraction techniques will be used. any for FS You can choose the algorithm. * Training and test sets should be adjusted by applying 5-fold cross validation. final fertility The results must be the average of the 5-fold results. * According to the results of classification and regression algorithms, the following tables are must be filled. *Parameter adjustment should be made for the classification and regression-based algorithms used. and the good results obtained should be added to the table.

Table 1: Classification Results

You need to code the project in Python language. Jupyter notebook should

Table 2: Regression Results

be used and a single code file with .ipynb extension should be

\begin{tabular}{|c|c|c|c|c|c|c|} \hline Method & Accuracy & F-measure & Precision & Recall & \begin{tabular}{c} Number \\ of \\ Features \end{tabular} & \begin{tabular}{l} Training/Testing \\ Time \end{tabular} \\ \hline \multicolumn{7}{|l|}{ KNN(TF-IDF) } \\ \hline \multicolumn{7}{|l|}{ Naive Bayes(TF-IDF) } \\ \hline \multicolumn{7}{|l|}{ Decision Tree(TF-IDF) } \\ \hline \multicolumn{7}{|l|}{ KNN (TF-IDF)+FS } \\ \hline \multicolumn{7}{|l|}{ Naive Bayes(TF-IDF)+FS } \\ \hline \multicolumn{7}{|l|}{ Decision Tree(TF-IDF)+FS } \\ \hline \multicolumn{7}{|l|}{ KNN(TF-IDF)+PCA } \\ \hline \multicolumn{7}{|l|}{ Naive Bayes(TF-IDF)+PCA } \\ \hline \multicolumn{7}{|l|}{ Decision Tree(TF-IDF)+PCA } \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} KNN \\ (n-gram (most popular)) \\ \end{tabular}} \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} Naive Bayes \\ (n-gram (most popular)) \end{tabular}} \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} Decision Tree \\ (n-gram (most popular)) \\ \end{tabular}} \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} KNN \\ (n-gram (most popular))+FS \\ \end{tabular}} \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} Naive Bayes \\ (n-gram (most popular))+FS \\ \end{tabular}} \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} Decision Tree \\ (n-gram (most popular))+FS \end{tabular}} \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} KNN \\ (n-gram (most popular))+PCA \end{tabular}} \\ \hline \multicolumn{7}{|l|}{\begin{tabular}{c} Naive Bayes \\ (n-gram (most popular))+PCA \end{tabular}} \\ \hline \begin{tabular}{c} Decision Tree \\ (n-gram (most popular))+PCA \end{tabular} & & & & & & \\ \hline \end{tabular} \begin{tabular}{|c|c|c|c|c|} \hline Method & MAE & RMSE & \begin{tabular}{l} Number of \\ Features \end{tabular} & \begin{tabular}{c} Training \\ Time / \\ Testing \\ Time \end{tabular} \\ \hline \multicolumn{5}{|l|}{ KNN Reg(TF-IDF) } \\ \hline \multicolumn{5}{|l|}{ Naive Bayes Reg(TF-IDF) } \\ \hline \multicolumn{5}{|l|}{ Decision Tree Reg(TF-IDF) } \\ \hline \multicolumn{5}{|l|}{ KNN Reg(TF-IDF)+FS } \\ \hline \multicolumn{5}{|l|}{ Naive Bayes Reg (TF-IDF)+FS } \\ \hline \multicolumn{5}{|l|}{ Decision Tree Reg (TF-IDF)+FS } \\ \hline \multicolumn{5}{|l|}{ KNN Reg (TF-IDF)+PCA } \\ \hline \multicolumn{5}{|l|}{ Naive Bayes Reg (TF-IDF)+PCA } \\ \hline \multicolumn{5}{|l|}{ Decision Tree Reg (TF-IDF)+PCA } \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} KNN Reg \\ (n-gram (most popular)) \\ \end{tabular}} \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} Naive Bayes Reg \\ (n-gram (most popular)) \end{tabular}} \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} Decision Tree Reg \\ (n-gram (most popular)) \\ \end{tabular}} \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} KNN Reg \\ (n-gram (most popular))+FS \end{tabular}} \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} Naive Bayes Reg \\ (n-gram (most popular))+FS \end{tabular}} \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} Decision Tree Reg \\ (n-gram (most popular))+FS \end{tabular}} \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} KNN \\ (n-gram (most popular))+PCA \end{tabular}} \\ \hline \multicolumn{5}{|l|}{\begin{tabular}{c} Naive Bayes \\ (n-gram (most popular))+PCA \end{tabular}} \\ \hline \begin{tabular}{c} Decision Tree \\ (n-gram (most popular))+PCA \end{tabular} & & & & \\ \hline \end{tabular}

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

This question is solved I want another solution //I get errors when running the codes // I get errors when running the codes // I get errors when running the codes // # Import necessary libraries...

You will develop a program intended for use by a Sports League for the administration of teams and players, e . g . English premiership, the United States NFL league. It should manage team, player...

Please, Please, I need help with why am I getting these errors Please, Please Next, you must develop a Python module in a PY file, using object-oriented programming methodology, to enable CRUD...

Instructions 1. Please use Python 3 for answering the following questions in case if you need to use Python. Also write a short report/note on explaining your assumptions/your design...

Assignment 2 Neural Networks with One Hidden Layer Due: 11:5SPM, November 8, 2018 Issued: October 23, 2018 First, go over the attached "Assignment2 codes ipynb" which implement neural networks with...

Project Python: Data Analysis Expectations: Understand the code demonstrated in the video tutorials, slightly modify the code to accomplish tasks below. Remember, you will not be able to accomplish...

The take-home examination is a written assignment of 2000-3000 words (main text, excl. list of references) that is completed individually by the students. The examination consists of the written...

I need help with the 4 remaining requirements mostly the first 3 my major hardship is getting the novel name by using user input between the 3 options user has for a path file (using 3 text files so...

I have to create a program in C and I can't figure it out. The program has to read a source file. Please help. /******************************************************************** PROJECT: Glossary...

OverviewFor this milestone, you will begin developing the Python code for a couple of your dashboard widgets in an IPYNB file in Jupyter Notebook. Specifically, you will begin coding the interactive...

You are valuing a potential take-over target, a private sports shoe manufacturer. Let's walk through some important questions. 1) Your colleague collects the information in Table 1. Included are D/E...

For a statistics class project, I used a convenience sample, but the results may still be meaningful. Decide whether the statement makes sense or does not? Explain clearly.

Presented below is the income statement of Cowan, Inc.:Sales revenue$ 3 8 0 , 0 0 0 Cost of goods sold 2 2 5 , 0 0 0 Gross profit$ 1 5 5 , 0 0 0 Operating expenses 9 5 , 0 0 0 Income before income...

Discuss several ways ( at least four ways ) to improve the requirement process and thus, a project team can avoid a situation where too many changes to the project requirements occur at the later...

Who responds to your customers complaint letters?

What are all the ways your customers can get back to you in written format?

What is your average response time to complaint letters? Do you use form letters when responding? If your response takes more time than usual, do you let your customers know why you havent gotten...