Question: Introduction A common undertaking for a data analyst is to connect an external data collection to an existing dataset. This process involves the exploration of
Introduction
A common undertaking for a data analyst is to connect an external data collection to an existing dataset. This process involves the exploration of the source and target datasets to merge the new data in a meaningful and logical way.
You will replicate this common process in this task. You will take external data from a commaseparated values CSV file and design conceptual, logical and physical models that describe the data. You will load the new data into an existing database alongside the existing data and run queries across it The existing tables and CSV data files are located on this Labs on Demand website, as well.
You must use the rubric to direct the creation of your submission because it provides detailed criteria that will be used to evaluate your work. Each requirement below may be evaluated by more than one rubric aspect. The rubric aspect titles may contain hyperlinks to relevant portions of the course.
EXECUTE THE TASK IN pgAdmin
Perform the following tasks to combine an existing dataset with external data:
A Provide a question that can be answered using structured query language SQL to acquire data from both the original database and the addon CSV file data. The question should require data from both data sources.
Justify how your question provided in part A will be answered.
Identify which data from both the original dataset and the addon CSV file are needed to answer the question provided in part A including all tables, columns, and data types.
B Create an entity relationship diagram ERD for the addon CSV file and any other tables and columns used to answer the question from part A by evaluating the data contained in the file and identifying the ::M or M:N relationships and relational constraints.
Describe the relationship between the existing table and the table created in part B and discuss any issues with the relationship in the ERD.
Write a SQL statement, in text format, that creates a table for the addon CSV based on the ERD and specifies the columns and relevant keys.
Write a SQL statement, in text format, that loads the data from one of the addon CSV files into the table created in part B
Note: Do not include SQL statements as a screenshot.
C Write one or more SQL statements in text format for a query or queries that answer the question provided in part A
Note: Do not include SQL statements as a screenshot.
Provide data files that capture the results from the query or queries.
D Identify the specific time period for how often the addon file should be acquired and refreshed in the database for the data to remain relevant to the business activities and the question from part A
Explain why the time period identified in part D is relevant to the specific business activities.
E Explain the whole process
A demonstration of each of the following:
All code used
The code being executed
The results of all code used in the task
A description of all programs used to complete the task
F Acknowledge two web sources for any code used to support the application. Be sure the web sources are reliable.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
