Question: I have written the code for extracting the data but the data is not scrapping into the csv file or excel sheet import scrapy import

I have written the code for extracting the data but the data is not scrapping into the csv file or excel sheet

import scrapy

import pandas as pd

import time

import csv

starttime=time.time()

class ArtisanDataSpider(scrapy.Spider):

name = "artisan_data"

start_urls = ['http://www.handicrafts.nic.in/ArtisanData.aspx?MID=SZmOd%2fCrxTo9CHD2XKF+pA%3d%3d']

def parse(self, response):

# Select the form and fill in the form data

form = response.xpath('//form[@id="form1"]')

form.xpath('.//select[@name="ddlState"]/option[text()="Uttar Pradesh"]/@value').extract_first()

form.xpath('.//select[@name="ddlDistrict"]/option[text()="Sant Ravidas Nagar"]/@value').extract_first()

form.xpath('.//select[@name="ddlDistrict"]/option[text()="Agra"]/@value').extract_first()

form.xpath('.//select[@name="ddlDistrict"]/option[text()="Varanasi"]/@value').extract_first()

yield scrapy.FormRequest.from_response(response, formdata={'ddlState': 'Uttar Pradesh', 'ddlDistrict': ['Sant Ravidas Nagar', 'Agra', 'Varanasi'],'btnSubmit': 'Submit'},

callback=self.parse_result)

def parse_result(self, response):

rows = response.xpath('//table[@id="gvArtisanData"]/tr')

for row in rows:

PEHCHAN_CARD_NO = row.xpath('./td[1]/text()').extract_first()

ARTISIAN_NAME = row.xpath('./td[2]/text()').extract_first()

Father_spouse = row.xpath('./td[3]/text()').extract_first()

Category = row.xpath('./td[4]/text()').extract_first()

AADHARNO = row.xpath('./td[5]/text()').extract_first()

NAME_OF_CRAFT = row.xpath('./td[6]/text()').extract_first()

MOBILENO= row.xpath('./td[7]/text()').extract_first()

VILLAGE= row.xpath('./td[8]/text()').extract_first()

TOWN= row.xpath('./td[9]/text()').extract_first()

CITY = row.xpath('./td[10]/text()').extract_first()

DISTRICT = row.xpath('./td[11]/text()').extract_first()

STATE = row.xpath('./td[12]/text()').extract_first()

yield {'PEHCHAN_CARD_NO':PEHCHAN_CARD_NO, 'ARTISIAN_NAME':ARTISIAN_NAME, 'Father_spouse':Father_spouse, 'Category': Category, 'AADHAR_NO': AADHAR_NO, 'NAME_OF_CRAFT':NAME_OF_CRAFT,'MOBILENO':MOBILENO,'VILLAGE':VILLAGE,'TOWN':TOWN,'CITY':CITY, 'DISTRICT':DISTRICT, 'STATE':STATE}

next_page = response.xpath('//a[text()="Next"]/@href').extract_first()

if next_page:

stripped = (line.strip() for line in scrapy.Request(response.urljoin(next_page), callback=self.parse_result))

lines = (line.split(",") for line in stripped if line)

with open('log.csv', 'w') as out_file:

writer = csv.writer(out_file)

writer.writerow(('title', 'intro'))

writer.writerows(lines)

endtime=time.time()

result=endtime-starttime

print("the time taken is:", result)

I am getting 0 items were scrapped

STATE = row.xpath('./td[12]/text()').extract_first()

next_page = response.xpath('//a[text()="Next"]/@href').extract_first()

stripped = (line.strip() for line in scrapy.Request(response.urljoin(next_page), callback=self.parse_result))

lines = (line.split(",") for line in stripped if line)

with open('log.csv', 'w') as out_file:

writer = csv.writer(out_file)

writer.writerow(('title', 'intro'))

writer.writerows(lines)

endtime=time.time()

result=endtime-starttime

print("the time taken is:", result)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

below I have written the code for extracting the data from url but the data is not getting in excel sheet and also the time calculation is getting 0 secs import scrapy import pandas as pd import time...

I have written the code for extracting the data but I am unable to get the data into excel sheet do I need to add any forloop for printing the data please check the code below import scrapy import...

Taxi Trip Records in New York City Data Analysis Assignment Introduction In this data analysis assignment, we will explore, clean, analyze, and visualize the Taxi Trip Records in New York City...

Here Below I have written the code for extracting the data from the given url but the data is not saving in the excel sheet so please let me know why the data is not saving in the excel sheet below...

I am looking for my python script to be interpreted, therefore, please address the following items: Define the null and alternative hypothesis in mathematical terms and in words. Report the level of...

In this discussion, you will apply the statistical concepts and techniques covered in this week's reading about one-way analysis of variance (ANOVA). An investment analyst is evaluating the 10-year...

Use the link in the Jupyter Notebook activity to access your Python script. Once you have made your calculations, complete discussion. The script will output answers to the questions given below. You...

Define the null and alternative hypothesis in mathematical terms and in words. Report the level of significance. Include the test statistic and the P-value. See Step 2 in the Python script. Provide...

Step 1: Uploading the dataset The data for this discussion is included in a CSV file called etf_returns.csv. It contains ten-year returns of 30 ETFs for three sectors: financial, energy, and...

You will apply the statistical concepts and techniques covered in this week's reading about one-way analysis of variance (ANOVA). An investment analyst is evaluating the 10-year mean return on...

Consider the Minitab output below. (a) Fill in the missing values. (b) Can the null hypothesis be rejected at the 0.05 level? Why? (c) Use the output and the t-table to find a 99% CI on the...

Describe compliance issues related to health, safety, and security in the workplace. Scenario Your organization has just established its first Health and Safety Committee, and participation in this...

What strategy is a portfolio manager of a mutual fund employing when call options are being sold on stock that s held in the portfolio

Directions: Choose the one alternative that BEST completes the statement or answers the question. Record your responses on the quiz booklet and answer sheet provided. 1-Qatar College requires all...

5. Structure your speech to make it easy to listen to

1. LaunchPad for Real Communication offers key term videos and encourages selfassessment through adaptive quizzing. Go to bedfordstmartins.com/realcomm to get access to: LearningCurve Adaptive...

1. Describe the goals of informative speaking