Question: Directions Dataset: You will be given two datasets: From the Continuous Glucose Sensor ( CGMData . csv ) From the insulin pump ( InsulinData .

Directions

Dataset:

You will be given two datasets:

From the Continuous Glucose Sensor

(

CGMData

.

csv

)

From the insulin pump

(

InsulinData

.

csv

)

The output of the CGM sensor consists of three columns:

Data time stamp

(

Columns B and C combined

)

the

5 -

minute filtered CGM reading in mg

/

, (

Column AE

)

the ISIG value which is the raw sensor output every

5

mins.

The output of the pump has the following information:

Data time stamp

Basal setting

Micro bolus every

5

mins

Meal intake amount in terms of grams of carbohydrate

Meal bolus

correction bolus

correction factor

CGM calibration or insulin reservoir

-

related alarms

auto mode exit events and unique codes representing reasons

(

Column Q

) .

The bold items are the columns you will use in this project.

Metrics to be extracted:

Percentage time in hyperglycemia

(

CGM

> 180

/

)

Percentage of time in hyperglycemia critical

(

CGM

> 250

/

)

Percentage time in range

(

CGM

> = 70

/

dL and CGM

< = 180

/

)

Percentage time in range secondary

(

CGM

> = 70

/

dL and CGM

< = 150

/

)

Percentage time in hypoglycemia level

1 (

CGM

< 70

/

)

Percentage time in hypoglycemia level

2 (

CGM

< 54

/

)

Each of the metrics mentioned above is extracted in three different time intervals: daytime

(6

am to midnight

),

overnight

(

midnight to

6

),

and whole day

(12

am to

12

) .

The percentage is for the total number of CGM data that should be available each day. Assume that the total number of CGM data that should be available is

288 .

There will be days such that the number of data available is less than

288,

but still, consider the percentage to be with respect to

288 .

You have to extract these metrics for each day and then report the mean value of each metric over all days. Hence there are

18

metrics to be extracted.

The metrics will be computed for two cases:

Case A: Manual mode

Case B: Auto mode

Analysis Procedure:

The data is in reverse order of time. This means that the first row is the end of the data collection whereas the last row is the beginning of the data collection. The data starts with manual mode. Manual mode continues until you get a message

AUTO MODE ACTIVE PLGM OFF

in the column

of the InsulinData.csv

.

From then onwards Auto mode starts. You may get multiple

AUTO MODE ACTIVE PLGM OFF

in column

but only use the earliest one to determine when you switch to auto mode. There is no switching back to manual mode, so the first task is to determine the time stamp when Auto mode starts. Remember that the time stamp of the CGM data is not the same

3

as the timestamp of the insulin pump data because these are two different devices that operate asynchronously.

Once you determine the start of Auto Mode from InsulinData.csv

,

you have to figure out the timestamp in CGMData.csv where Auto Mode starts. This can be done simply by searching for the time stamp nearest to

(

and later than

)

the Auto mode start time stamp obtained from InsulinData.csv

.

For each user, CGM data is first parsed and divided into segments, where each segment corresponds to a day's worth of data. One day is considered to start at

12

am and end at

11

59

.

If there is no CGM data loss, then there should be

288

samples in each segment. The segment as a whole is used to compute the metrics for the whole day time period. Each segment is then divided into two sub

-

segments: the daytime sub

-

segment and the overnight sub

-

segment. For each subsegment, the CGM series is investigated to count the number of samples that belong to the ranges specified in the metrics. To compute the percentage with respect to

24

hours, the total number of samples in the specified range is divided by

288 .

Note that here you have to tackle the

missing data problem

,

so a particular may not have all

288

data points. In the data files, those are represented as NaN. You need to devise a strategy to tackle the missing data problem. Popular strategies include deletion of the entire day of data, or interpolation.

Write a Python script that accepts two CSV files: CGMData.csv and InsulinData.csv and runs the analysis procedure and outputs the metrics discussed in the metrics section in another CSV file using the format described in Result.csv

.

Submission Directions for Project Deliverables

This project will be auto

-

graded. You must complete and submit your work through Ed Lesson

s code challenges to receive credit for the course:

To get started, use the

main

.

'

file provided in your workspace.

All necessary datasets are already loaded into the workspace.

Execute your code by running the

python

3

main.py

command in the terminal to test your work.

On completing your work, submit it for auto

-

grading by clicking the Test button.

You will know you have completed the assignment when feedback appears for each test case with a score.

If needed: to resubmit the assignment in Ed Lesson

Edit your work in

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!

Hello, I am a bit stuck on my assignment this week. I believe I have figured out steps 1-3. I am a bit stuck on 4-6. Any help would be appreciated. " This notebook contains the step-by-step...

. Summary For this project you need to think of an opportunity or challenge your organization or department is currently facing and ask a management question to address it. You will also need to...

MATH 221 Statistics for Decision Making Week 6 iLab Name:_______________________ Statistical Concepts: Data Simulation Confidence Intervals Normal Probabilities Short Answer Writing Assignment All...

Drive (miles) 20 28 54 76 36 88 6 25 40 42 55 71 76 78 4 20 25 29 33 36 36 36 63 73 73 76 80 33 54 63 76 80 80 94 6 State OR NY NV PA TX SC MI CA IL IL OH MI NY CA NY IL GA TX TX MI IL OR MI FL SC NY...

This is Everything they gave us. They want us to write the code: The dataframe for your team is called your_team_df. The variable 'pts' represents the points scored by your team. Calculate and print...

The website for your business has now developed to the point where it has information about the business itself and some more detailed information about its products and/or services, as well as some...

Using PYTHON CODE I need help with step 4. Step 4: Hypothesis Test for the Population Mean (II) A team averaging 110 points is likely to do very well during the regular season. The coach of your team...

Figure 1 : Visual representation of the walking - related activities. Figure 2 Diagram of the data collection process for the dataset ( resulting in six time series for each observation ) . Figure 3...

As stated in directions, must use at least one of the iomanip.h functionalities shown above. Not specified which, thanks! The Game of Life, invented by the mathematician John H. Conway, is intended...

Assignment 3: t Tests and ANOVA This week, you explore key statistical concepts related to data and problem solving through the completion of the following exercises using SPSS and the information...

Selected account balances from the BCooper company at 12/31/20 are as follows: - Accounts Payable 13,200 - Net Income $24,200 - Owner Withdrawls 5,400 - Equipment 32,500 - Owner, Capital...

On July 15, 2011, the Nixon Car Company purchased 1,000 tires from the Harwell Company for $50 each. The terms of the sale were 2/10, n/30. Nixon uses a periodic inventory system and the net method...

Twenty - nine - year old female Constance wished to purchase $ 1 0 0 , 0 0 0 of five - year term life insurance. How much are the premiums $ 8 5 0 $ 1 8 7 $ 1 8 5

Section B 1. Considering the figures shown in the figures below, what should the cash flow be on year 4 if the target internal rate of return (IRR) is 6.12% ? Please explain the concept of IRR and...