Question: Processing a gzip file Now complete the function pull_basic_and_predictor_fields_gzip to repeat the above function but use the gzipped file named: test_4families_annovar.vcf.gz. The file is too

Processing a gzip file Now complete the function pull_basic_and_predictor_fields_gzip to repeat the above function but use the gzipped file named: test_4families_annovar.vcf.gz. The file is too large to unzip and use, so we will use it as is. To read the file one file at a time, use the following code: import gzip with gzip.open(filename,'rt') as fp: for line in fp: pass The output of this function is given in expected_mini_project1_gzip.json. Note that you will have to parse the whole file again, but only pull out the fields used in the pull_basic_and_predictor_fields Save the output as mini_project1_gzip.json -- use indent=2, sort_keys=True

def pull_basic_and_predictor_fields_gzip(filename):

# BEGIN SOLUTION

#YourCodeHere

# END SOLUTION

IT should pass this test:

def test_pull_basic_and_predictor_fields_gzip(self):

import json

filename = 'test_4families_annovar.vcf.gz'

if os.path.exists('mini_project1_gzip.json'):

os.remove('mini_project1_gzip.json')

self.mini_project1.pull_basic_and_predictor_fields_gzip(filename)

expected_result = json.load(open('expected_mini_project1_gzip.json'))

value = json.load(open('mini_project1_gzip.json'))

self.assertEqual(expected_result, value)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Computer Network Questions!

CANMNMM January of this year. (a) Each item will be held in a record. Describe all the data structures that must refer to these records to implement the required functionality. Describe all the...

A creative engineer suggests structuring the TLB so that not all the bits of the presented address need match to result in a hit. Suggest how this might be achieved, and what might be the costs and...

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

Doggie Nuggets Inc. (DNI) sells large bags of dog food to warehouse clubs. DNI uses an automatic filling process to fill the bags. Weights of the filled bags are approximately normally distributed...

A fundamental software engineering skill is the design, implementation, and testing of a software component that may be integrated into a larger software product. In order to do this, the software...

School of Computing and Information Technology Session: Spring 2023 University of Wollongong Lecturer: Janusz R. Getta ISIT912 Big Data Management Assignment 2 Published on 21 August 2023 Scope The...

Introduction and learning objectives When you were learning about operational analysis earlier in the term, we talked about jobs that require multiple visits to the CPU (or servers) to receive their...

Assignment 2 Due February 7 Overview In this assignment, you will store more information for the game board. Instead of just a money value, each grid cell on the board will have a tile with a genus,...

I need this solved with all of the steps shown and explained in full detail. If so I will give you the best rating. I have provided makefile below. I need to the multitasking commandar done NOTHING...

Using the results of Problem 66, we can find the electric field at any radius for any spherically symmetrical charge distribution. A solid sphere of charge of radius R has a total charge of q...

A uniform surface charge of density 8.0nC/m2 is distributed over the entire xy plane. What is the electric flux through a spherical Gaussian surface centered on the origin and having a radius of 5.0...

please hel me to Think about the risks that come with international business transactions whether it s currency fluctuations, protecting intellectual property, labor ethics, shipping issues, or...

Design a supercritical Rankin cycle with two reheaters that operates at 30 MPa and 350 C, the reheat pressures are 20% from each previous pressure, and the reheat temperatures are 350 and 320 C...

Design a 1-m-long cylindrical wind tunnel whose diameter is 25 cm operating at a Mach number of 1.8. Atmospheric air enters the wind tunnel through a converging diverging nozzle where it is...

Construct a 99% confidence interval for the difference between the proportion of workers and proportion of mangers who said it was unethical to monitor employee email.

Design a minimum-mass symmetric three-bar truss (the area of member 1 and that of member 3 are the same) to support a load P, as was shown in Fig. 2.9. The following notation may be used: Pu = P cos...

What have you done that shows initiative and willingness to work?

Why do you want this job?

Following a job interview, you should send a thank-you message a . Within two days aft er the interview b. Only if you think you got the job c. Th at follows the AIDA organizational model d. Th at...