I am a college student who needs some help with building a data dictionary for my class
Question:
I am a college student who needs some help with building a data dictionary for my class project on Covid-19. I have included a copy of my drafted project proposal below.
The selected datasets share a common theme and that meet the following file structure requirements:
- One csv file.
- One json file.
- One txt file.
I need help building the data dictionary using Python.
The data dictionary should include:
- Field name in original source
- Field name in your database
- data type
- Short description of data field
- Is the field required?
- Will the field accept a NULL value?
My Draft Project Proposal:
Introduction
In the face of the unprecedented challenges posed by the global COVID-19 pandemic, the imperative for robust data management, analysis, and reporting has never been more pronounced. This project responds to this need by harnessing the wealth of information available in datasets related to COVID-19. We aim to establish a comprehensive database solution that addresses the situation's urgency and empowers stakeholders with tools for informed decision-making, insightful analysis, and effective reporting. By integrating diverse datasetsencompassing COVID-19 cases, social media sentiments, and academic research paperswe aspire to forge a dynamic platform that transcends traditional data boundaries.
Project Overview
This project centers around the development of a database infrastructure that seamlessly integrates three diverse datasets related to the COVID-19 pandemic: COVID-19 cases by country in CSV format, COVID-19 vaccination data in JSON format, and the COVID-19 Open Research Dataset in TXT format. The chosen datasets have been carefully selected to provide a holistic understanding of the pandemic's multifaceted aspects. By harmonizing these datasets, we aim to build a unified platform that facilitates comprehensive analysis, data-driven decision-making, and a nuanced exploration of the global COVID-19 landscape. Chosen datasets include:
- Covid-19 cases by Country (CSV)
- Description: This dataset comprehensively overviews COVID-19 cases, deaths, and vaccinations globally. By structuring the data by country and date, it serves as a foundational element for statistical analysis.
- JSON File - COVID-19 Vaccination Data
- Description: This data set captures the pulse of public sentiment and comprises tweets related to COVID-19. The JSON format allows in-depth exploration of individual tweets, user information, and sentiments expressed during the pandemic.
- COVID-19 Open Research Dataset (TXT):
- Description: Focusing on the scholarly dimension, this dataset presents textual data in TXT format, offering a rich source of academic research papers related to COVID-19.
Through these endeavors, we aim to contribute to the collective understanding of COVID-19, empower stakeholders with valuable insights, and foster a resilient data infrastructure ready to address our time's challenges.
References
Kohlmeier, S., Lo, K., Wang, L. L., & Yang, J. (2020, May 7). Covid-19 open research dataset (cord-19). Zenodo. https://zenodo.org/records/3765923
Mathieu, E., Ritchie, H., Rods-Guirao, L., Appel, C., Giattino, C., Hasell, J., Macdonald, B., Dattani, S., Beltekian, D., Ortiz-Ospina, E., & Roser, M. (2020, March 5). Coronavirus pandemic (COVID-19). Our World in Data. https://ourworldindata.org/coronavirus
P., D. K. (2020, August 7). Covid-19 dataset. Kaggle. https://www.kaggle.com/datasets/imdevskp/corona-virus-report
https://www.kaggle.com/datasets/gpreda/covid19-tweets
Rabindra Lamsal. (2020). Coronavirus (COVID-19) Tweets Dataset. IEEE Dataport.https://dx.doi.org/10.21227/781w-ef42
Microeconomics An Intuitive Approach with Calculus
ISBN: 978-0538453257
1st edition
Authors: Thomas Nechyba