Question: This is the spreadsheet:5%_PUMS_record_layout.xls Hi, the output that i'm getting is (2, 8), but I need it to be (1,8) when I print SERIALNO, I

This is the spreadsheet:5%_PUMS_record_layout.xls Hi, the output that i'm getting is

This is the spreadsheet:5%_PUMS_record_layout.xls

(2, 8), but I need it to be (1,8) when I print

SERIALNO, I was trying to subtract value of 1 for column BEG,

Hi, the output that i'm getting is (2, 8), but I need it to be (1,8) when I print SERIALNO, I was trying to subtract value of 1 for column BEG, but I keep getting errors. Could you provide codes for it? or any suggestion? Thanks

Read the spreadsheet: 5%_PUMS_record_layout.xls and create two dictionaries - one for housing and one for persons. The first named puma5LayoutHousing with a key of the variable name and a value of a tuple having the start-1 then end position of the location of the variable in the data record. These correspond to the Python indexes of the character positions of the field. You should then find this useful in parsing the data records when you read the data file. The following literal is an example of what your dictionary should contain: {"SERIALNO":(1,8)} Important Note Your notebook should contain the following: print(" HOUSING Data Dictionary ", housing.head()) A B C D E 2 RT BEG END LEN A/N VARIABLE DESCRIPTION 4 H 1 1 1A RECTYPE Record Type 6 H 2 8 7A SERIALNO Housing Group Quarters (GQ) Unit Serial Number 8 H 9 91A SAMPLE Sample Identifier 10 11 2A STATE 10 11 2 A STATE State Code State Code 10 H 11 H 12 13 H 14 H 15 H 16 H 17 H 12 12 12 12 12 1A 1A 1A 1A 1A REGION REGION REGION REGION REGION Region Code Region Code Region Code Region Code Region Code 12 12 18 13 13 13 13 13 19 H 20 H 21 H 22 H 23 H 24 H 25 H 26 H 27 H 28 H 1A 1A 1A 1A 1A 1A 1A 1A 1A 1A DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION Division Code Division Code Division Code Division Code Division Code Division Code Division Code Division Code Division Code Division Code 13 13 13 13 1313 13 13 13 13 an import pandas as pd import numpy as np import re dataFolder = '/Users/Derek/Documents/LA&S 792/Dataset/documentation' + '/'. housing = pd.read_excel (dataFolder + "58_PUMS_record_layout.xls", sheet_name="Housing Unit Record", header=1, usecols=["BEG", "END", "VARIABLE"]). #dtype={ "BEG":str, "END":str}) #search for numeric value in BEG isIt = [bool(re.search('id', str(value))) for value in housing.BEG] df = pd. DataFrame (housing) #df[ 'new_column'] = df [ 'BEG'] #housing["R"]=int("BEG")-1 #housing = housing[isit] puma5LayoutHousing = {} for row in housing.itertuples(): puma 5 LayoutHousing[row. VARIABLE] = (row.BEG, row.END) print (puma5LayoutHousing[ "SERIALNO" ]). Read the spreadsheet: 5%_PUMS_record_layout.xls and create two dictionaries - one for housing and one for persons. The first named puma5LayoutHousing with a key of the variable name and a value of a tuple having the start-1 then end position of the location of the variable in the data record. These correspond to the Python indexes of the character positions of the field. You should then find this useful in parsing the data records when you read the data file. The following literal is an example of what your dictionary should contain: {"SERIALNO":(1,8)} Important Note Your notebook should contain the following: print(" HOUSING Data Dictionary ", housing.head()) A B C D E 2 RT BEG END LEN A/N VARIABLE DESCRIPTION 4 H 1 1 1A RECTYPE Record Type 6 H 2 8 7A SERIALNO Housing Group Quarters (GQ) Unit Serial Number 8 H 9 91A SAMPLE Sample Identifier 10 11 2A STATE 10 11 2 A STATE State Code State Code 10 H 11 H 12 13 H 14 H 15 H 16 H 17 H 12 12 12 12 12 1A 1A 1A 1A 1A REGION REGION REGION REGION REGION Region Code Region Code Region Code Region Code Region Code 12 12 18 13 13 13 13 13 19 H 20 H 21 H 22 H 23 H 24 H 25 H 26 H 27 H 28 H 1A 1A 1A 1A 1A 1A 1A 1A 1A 1A DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION DIVISION Division Code Division Code Division Code Division Code Division Code Division Code Division Code Division Code Division Code Division Code 13 13 13 13 1313 13 13 13 13 an import pandas as pd import numpy as np import re dataFolder = '/Users/Derek/Documents/LA&S 792/Dataset/documentation' + '/'. housing = pd.read_excel (dataFolder + "58_PUMS_record_layout.xls", sheet_name="Housing Unit Record", header=1, usecols=["BEG", "END", "VARIABLE"]). #dtype={ "BEG":str, "END":str}) #search for numeric value in BEG isIt = [bool(re.search('id', str(value))) for value in housing.BEG] df = pd. DataFrame (housing) #df[ 'new_column'] = df [ 'BEG'] #housing["R"]=int("BEG")-1 #housing = housing[isit] puma5LayoutHousing = {} for row in housing.itertuples(): puma 5 LayoutHousing[row. VARIABLE] = (row.BEG, row.END) print (puma5LayoutHousing[ "SERIALNO" ])

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Does anyone know about this of doing Assembly language codes and pseudocode? Who knows how to putting it on with Visual Studio of those problems that I show? CSP 25 Assembly Language Lab #2...

Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

MEAN 6.0000 WHEN WE HAVE LARGE DATA SETS, WE GROUP THE DATA. IN THIS CASE OUR GROUPS WILL BE: STD DEV 2.17 IN A NORMAL DISTRIBUTION THE MEAN, MEDIAN AND MODE ARE ALL THE SAME NUMBER X-VALUES Z-VALUES...

Project 3: BitVector & Application Implementing BitVector of undetermined size implementing new concepts of copy constructor and assignment operators as well as destructors. Educational Objectives....

WEEK 4 HOMEWORK: LANE CHAPTER 7 AND ILLOWSKY CHAPTERS 6 AND 7 THE NORMAL DISTRIBUTION Z-TABLES ARE ATTACHED AND YOU ARE TO USE THEM RATHER THAN SOFTWARE TO SOLVE THESE PROBLEMS. (THIS IS STRAIGHT...

the bottom of the main loop (after getting user input), increment the current player. Then, if the number is too high, reset it to 0. Before printing whose turn it is, print the board using one of...

I require help with this Statistics homework, as in it's completion if possible please. Thanks it is due by Friday the 15 \f\fWEEK 4 HOMEWORK: LANE CHAPTER 7 AND ILLOWSKY CHAPTERS 6 AND 7 THE NORMAL...

Functions Summary In this graded assignment, you will write a Python function that implements various algorithms. Learning Outcomes In completing this assignment, you will: Implement a function in...

still not running correctly after getting some help... import java.util.Scanner; import java.io.File; import java.io.FileNotFoundException; import java.io.IOException; import java.io.PrintWriter;...

1411116 - Programming I Assignment #3 Due Date: November 30, 2016 Submission Instructions: Submit your assignment on the blackboard link, corresponding to your Section: Please follow the following...

Connolly Corporation has issued 100,000 shares of $5 par value common stock . It was authorized 500,000 shares. The paid-in capital in excess of par value on the common stock is $287,000. The...

Following is information about the common equity of Funtastic Furniture Company: Current selling price ............. $68.00 Constant growth rate ............ 8.0% Most recently paid dividend, D0...

How can I replace the word idealization. Dr . House's relationship with Wilson, Cuddy, and his team are volatile, splitting between idealization and devaluation.

last two options for the multiple choice are : performance management development A construction equipment manufacturer, Roswell Corporation, is focusing on becoming a leader in sustainability in...

Technology. Use the spell-check feature of your word processing program to check for errors in the following paragraph. Compare the results with errors you find as you proofread the text. (Objective...

In conjunction with the launch of a new program of study, your school will host an open house on the second Tuesday of next month. You have been asked to announce the event to the students in your...

Teamwork. Technology. Form a four-person group consisting of two subgroups. Each subgroup will find an appropriate substitute for the words in the following list. One subgroup will use a traditional...