Instructions: In assignment 1, I had you provide me some details on the content of a...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables. Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables. Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables. Instructions: In assignment 1, I had you provide me some details on the content of a data set. That assignment was intended to help you get acclimated to the R environment. Here, I want to give you the same opportunity using Python! Your instructions are to create the same outputs that you had using R but instead using Python! As before, I have intentionally left a few minor details for you to either reason through or research. As before, I am also aware that there are lots of ways to accomplish many of these tasks in Python! I do not really care which way you do them so long your output answers the questions asked. NOTE: For future assignments, I will ask that you pick either R or Python to complete your assignment. Expectation: This assignment is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so. 1. Load the data file. 2. Provide to me the structure of the loaded data set (viz. something equivalent to the R command str). 3. Provide me a "summary" of the loaded data structure. (viz. something equivalent to the R command summary) 4. Count of releases per year. 5. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc). 6. Average number of the Lines of Code (LOC) per releases per year. 7. Average size of the file size per year. 8. Create a single data structure through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables.
Expert Answer:
Answer rating: 100% (QA)
It appears that you have an assignment that requires using Python to perform data analysis tasks that you previously did with R Unfortunately you have... View the full answer
Related Book For
Fundamentals of Case Management Practice Skills for the Human Services
ISBN: 978-1305094765
5th edition
Authors: Nancy Summers
Posted Date:
Students also viewed these programming questions
-
Current the U.S. federal debt is $32.3 trillion and rising: https://www.usdebtclock.org/ What will the consequences be if it continues to increase at the current rate?
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
Case Study: Quick Fix Dental Practice Technology requirements Application must be built using Visual Studio 2019 or Visual Studio 2017, professional or enterprise. The community edition is not...
-
Why should the digital divide matter to people who can afford to be fully connected?
-
A uniform electric field of magnitude 325 V/m is directed in the negative y direction in Figure P25.9. The coordinates of point A are (-0.200, -0.300) m, and those of point B are (0.400, 0.500) m....
-
Write Re(e1/z) in terms of x and y. Why is this function harmonic in every domain that does not contain the origin?
-
Laminar flow normally persists on a smooth flat plate until a critical Reynolds number value is reached. However, the flow can be tripped to a turbulent state by adding roughness to the leading edge...
-
The merchandise inventory was destroyed by fire on August 19. The following data were obtained from the accounting records: a. Estimate the cost of the merchandise destroyed. b. Briefly describe the...
-
Distinguish the various methods used to engage and motivate employees for Baby Boomers. Explain these methods in details.
-
1. Are the four intrinsic characteristics the best ones to base the relationship performance measures for Donnell Truong Ventures? If not, what characteristics would be more suitable? In either case,...
-
The base of a triangle is 6 meters longer than the height of the triangle. If the area of the triangle is 108 square meters, what are the base and height of the triangle?
-
A lawyer's fee shall be reasonable. The factors to be considered in determining the reasonableness of a fee include the following: the time and labor required; the novelty and difficulty of the...
-
Required Use the following information to complete William Wyatt's 2021 federal income tax return If information is missing, use reasonable assumptions to fill in the gaps. You may need the following...
-
Indicate if each of the following mechanistic steps is drawn correctly or incorrectly. HC- HC-C- HC- CH OH CH CH CH H -CH OH O-H CH, HC-C- -CH; OH CH3 CH3 HC-C-C-CH; + CH, J H CH3 HC-C-C-CH IL OH CH...
-
Retailers engaged in business in California must register with whom in order to appropriately collect, report, and pay sales tax for operating their businesses?
-
Uncle Ben sees a "help wanted sign" in the window of Aunt Linda's Coffee House. He decides to try apply. Aunt Linda reviews his application and says to Uncle Ben, "I will offer you $15 to run the...
-
The average time for the 200-meter butterfly on the Lake Highlands 5 Swim Team is 29.3 seconds. The slowest time and fastest recorded times varied from the acreage by 5.3 seconds. What absolute value...
-
Outline some of the major problems confronting an international advertiser.
-
A woman, who has been a prostitute, recently discovered she is HIV+. She is currently staying in a shelter where you see her. Several nights she comes in drunk and tells you, Hey, it doesnt hurt as...
-
You are the case manager for a man who has only recently had a first manic episode. He had submitted to treatment, responded well, and returned to work. However, he is currently experiencing another...
-
A man from Vietnam is in your office because his 11-year-old daughter has been having trouble in school. The school suggested the daughter be tested by your agency. You are doing the intake, but only...
-
The following is the trial balance of Sanjay Industries Ltd. as on 31st March 2006. Further information 1.Outstanding rent amounted to 7,200 while outstanding salaries 8,100 at the end of the year....
-
Refer to the case of Monik Traders given in the exercises of the last chapter. Monik Varma now wants to know as to where his firm stands after one month of running of the business. Help him. Towards...
-
The accountant of Pushpa Engineering Company Ltd. has prepared the following trial balance of the company as on 31st March, 2006. Further information 1. Authorised equity share capital of the company...
Study smarter with the SolutionInn App