Question: Why is my accuracy score so poor? The database is from a JMU Sports forum. I put in pymongo. I dont understand why my accuracy

Why is my accuracy score so poor? The database is from a JMU Sports forum. I put in pymongo. I dont understand why my accuracy is so low. I basically extracted it put it in a list. Made a new list where it takes the top 80 most prolific authors.

from pymongo import MongoClient db = MongoClient().db_name.collection_name users = [] for user in db.find(): users.append(user) print(users[1]) {'_id': 14069840, 'page': 1, 'post_id': 1465, 'post': 'I laughed too hard. Whoops.', 'username': 'Potomac', 'timestamp': '2017-02'} from collections import Counter pr = Counter([review['username'] for review in users]).most_common(80) pr[:1] [('BleedingPurple', 7619)]

keep_ids = {pr[0] : 0 for pr in prolific_reviewers}

keep_reviews = []

for review in reviews:

uid = review['user_id']

if uid in keep_ids and keep_ids[uid] < 500:

keep_reviews.append(review)

keep_ids[uid] += 1

 authors = [review['username'] for review in keep_reviews] text = [review['post'] for review in keep_reviews] from sklearn.feature_extraction.text import TfidfVectorizer from sklearn.svm import LinearSVC from sklearn.model_selection import train_test_split vectorizer = TfidfVectorizer() vectors = vectorizer.fit_transform(texts) print(vectors.shape) (146816, 68140) X_train, X_test, y_train, y_test = train_test_split(vectors, authors, test_size=0.2, random_state=1337) print(X_train.shape, X_test.shape) (117452, 68140) (29364, 68140) svm = LinearSVC() svm.fit(X_train, y_train) predictions = svm.predict(X_test) from sklearn.metrics import accuracy_score print(accuracy_score(y_test, predictions)) 0.18

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Identify and discuss the benefits of using different types of instructional feedback. Note : You must cite the reference Augmented Feedback How Giving Feedback Influences Learning KEY TERMS absolute...

Providing Quality School-Based Learning and Support Services 239 Chapter 6 Language and literacy support Your core task The core task of almost all TAs is to support students language and literacy...

Jordan 225 23 Uganda HRM Strategic Alignment and Visibility in Uganda John C. Munene and Florence Nansubuga Human resources management is in its infancy in Uganda despite the high demand cre ated by...

rumal of Information Technology Teaching Cases 2011 CNC Mac 2016/1 Teaching case From theme park to resort: customer information management at Port Aventura Mariano A Hervs, Joan Rodon, Marc Planell,...

Please provide summary of the above text (More than 3000 words) **There should be no plagiarism. If you provide wrong solution then i will dislike and report you abusive and spam. Don't waste my...

\fNew research suggests that the most effective executives use a collection o f distinct leadership styles each in the right measure, at just the right time. Such flexibility is tough to put into...

PLEASE READ CAREFULLY THE CASE STUDY PROVIDED AND FEEL FREE TO ADD HERE YOUR COMMENTS FOR EXAMPLE LIKES DISLIKES WORDS OR PHRASES YOU DO NOT UNDERSTAND ANY COMMENTS THAT WILL IMPROVE THE DIALOGUE...

CERTIFICATE IV IN FINANCE AND MORTGAGE BROKING - FN540820 Page 1 UNIT 9 MANAGE PERSONAL AND PROFESSIONAL DEVELOPMENT Unit Code: BSBPEF501 This unit describes the skills and knowledge required to...

For some years now, you've owned a small specialty bookshop in a college town. You sell some textbooks but mainly cater to a broader customer base. Your store always stocks the latest fiction,...

I need a 10 page paper for my MIS class. Please do not copy and paste as my school is getting stricter on plagiarism. I have attached the assignment and the sample \fData Analytic Thinking 1 Data...

College student Jacqueline Loya asked 50 employed students how many times they went out to eat last week. Half of the students had full-time jobs and half had part-time jobs. Full-time: 5, 3, 4, 4,...

The December 31, 2024, adjusted trial balance for Kline Enterprises was as follows: Account Title Accounts payable. Accounts receivable Accumulated depreciation Common stock Cash Cost of goods sold...

"Decoding: Banks - Episode 4 - How does lending work" suggests that financial education will become less important in the future. Question 2 0 options: 1 ) True 2 ) False

Jessie, an unmarried taxpayer using the single filing status, received $16,000 of Social Security retirement benefits this year. Jessie also received $5,000 of interest income and $45,000 of income...

11. How has new technology improved training and development? What are some of the limitations of using iPods or PDAs for training?

10. What are the implications of the aging work force? What strategies should companies consider from a training and development perspective to best utilize older employees and prepare for their...

13. What is the relationship between talent management and employee engagement? What role can training and development practices play in keeping employee engagement high during poor economic times?...