Does normalization place any constraint on the storage of data in physical form or on its processing performance?Explain.
What do you understand by domain constraint?
Outline a shortcut to describe relations in 3NF.
Discuss how transitive dependencies in a relation can be removed when it leads to anomalies.
List the three steps to remove transitive dependencies.
What is an enterprise key, and why is it important?
Why is the natural key preserved whenever a surrogate key is created?
What are the benefits of the use of an enterprise key?
Contrast the following terms:a. Scalar aggregate; vector aggregateb. DDL; DMLc. Catalog; schema
What are the primary data integrity constraints in SQL?
How do you determine the order in which the rows in a response to an SQL query appear? What options do you have when specifying this order?
How is the HAVING clause different from the WHERE clause?
State four rules for choosing indexes for a relational database.
How can an SQL command be structured to allow parallel execution?
Use SQL to define the following view:
Write SQL queries to answer the following questions:a. For each course included in the QUALIFIED table, list Course ID and the number of faculty
What are the Tutor IDs for tutors who have not yet tutored anyone?
How many students were matched with someone in the first five months of the year?
List the number of customers living at each state that is included in the Customer_T table.
For each payment made on or after March 10, 2018, list PaymentID, OrderID, PaymentAmount, and the first 10characters of Payment Comment.
What type of an impact has the significant decrease in the cost of storage space had on data warehouse and data mart design?
Describe the mechanism through which prescriptive analytics is dependent on descriptive and predictive analytics.
Research available data quality software. Describe in detail at least one technique employed by one of these tools (e.g., an expert system).
Contrast the following terms:a. Two-tier architecture; three-tier architectureb. Fat client; thin clientc. Optimistic concurrency control;
List the average number of customers per state (including only the states that are included in the Customer_T table).A query can be used as a table
Contrast the following terms:a. Horizontal partitioning; vertical partitioningb. Repository; data dictionaryc. Physical file; tablespaced. Before
Using the Web and Internet resources, search for the application areas of parallel processing. Which DBMS in the market supports this feature?
Denormalization can be a controversial topic among database designers. Some believe that any database should be fully normalized (even using all the
Visit the database designer or administrator of an organization that has heavy transaction-oriented applications and requires frequent updates to
Visit an organization that has implemented a database approach. Evaluate each of the following:a. The organizational placement of data
Visit an organization that has implemented a database approach and interview an MIS department employee who has been involved in disaster recovery
Visit an organization that has implemented a database approach and interview individuals there about the security measures they take routinely.
Contact the DBA of an organization you are familiar with. Interview them to understand how they collect, manage, and utilize metadata. Do they store
Visit the database designer or administrator of an organization that conducts e-business. Ask them if they have ever experienced database downtime.
The text lists a number of cloud service providers that offer traditional data management services like MS SQL database, Amazon RDS, and cloud
Why does an information gap still exist despite the surge in data in most firms?
List the issues that one encounters while achieving a single corporate view of data in a firm.
Explain the need to separate operational and information systems.
Why is it important to consolidate a Web-based customer interaction in a data warehouse?
List the 10 essential rules for dimensional modeling.
What is meant by a corporate information factory?
List the different roles played by data marts and data warehouses in a data warehouse environment.
Explain the components of a star schema with figures and suitable examples.
Explain how the characteristics of data for data warehousing is different from the characteristics of data for operational databases.
Why is real-time data warehousing called active data warehousing?
Discuss the role of an enterprise data model and metadata in the architecture of a data warehouse.
List and describe the various situations in which it becomes essential to further normalize dimension tables.
A table Student stores the values StudentID, name, date of result, and total marks obtained. A student’s information is: StudentID: S876, Name:
Define each of the following terms:a. Hadoopb. MapReducec. HDFSd. NoSQLe. Pig
Contrast the following terms:a. Data lake; data warehouseb. Pig; Hivec. Volume; velocityd. NoSQL; SQL
What are the two challenges faced in visualizing big data?
What is the format that can be used to describe database schema besides JSON?
Compare the JSON and XML representations of a record in Figure 10-1. What is the primary difference between these? Can you identify any advantages of
What is the trade-off one needs to consider while using a NoSQL database management system?
Discuss the features of NoSQL DBMS that ensure high availability but do not guarantee consistency.
How does HDFS aid in coping with hardware failure?
HDase and Cassandra share a common purpose. What is it? What is their relationship to HDFS and Google BigTable?
Use the Internet to browse the features and offerings of Big Data platforms such as haven and Aster. Prepare a report of your findings.
Define each of the following terms:a. Data miningb. Online analytical processingc. Business intelligenced. Predictive analyticse. Apache Spark
Contrast the following terms:a. Data mining; text miningb. ROLAP; MOLAPc. R; Python
Discuss the different types of dashboards and their role in business performance management.
Illustrate the goals of data mining and how they answer fundamental business questions
Discuss why data mining applications are growing rapidly in business.
How is KNIME used as a predictive analytics tool?
How is data quality and management vital in realizing the full potential of big data and analytics?
For each scenario listed below, identify the following: the type of business analytics, the era of BI&A, the goal of data mining (if applicable),
How does Apache Spark differ from Hadoop?
Compare and contrast R and Python as computational environments for analytics.
Briefly describe three types of operations that can easily be performed with OLAP tools.
Using a drawing tool such as Microsoft PowerPoint, design a simple prototype of a top-management dashboard for Fitchwood Insurance Company.
Teradata University Network (TUN) offers access to a tool called SAS Visual Analytics (VA). Use TUN to access SAS VA and complete some of the
Another tool featured on TUN is called Tableau. Tableau offers a student version of its product for free (www.tableausoftware.com/academic/students),
Contrast the following terms:a. Chief data officer; DBAb. Data administration; database administrationc. Open source DBMS; commercial DBMSd. ETL; MDM
Why are data administrators required to maintain an information repository?
What functions require the input and involvement of both the data administrator and the database administrator?
How can the data capture process be improved?
Explain how an organization’s business rules can be checked as part of a data audit.
How can fuzzy logic, pattern matching, and expert systems be used to improve data quality?
Visit some Web sites for open source databases, such as www.postgresql.org and www.mysql.com. What do you see as major differences in administration
The European Union has recently applied the General Protection of Data Regulation, which governs the way firms doing business in the European Union
Visit libereurope.eu/webinars/and write a report on the key current issues effecting data management.
Following on from Question 12-42, visit the European Union’s General Data Protection Regulation Web site and discuss how the new regulation will
Visit an organization that uses an open-source DBMS. Why did the organization choose open-source software? Does it have other open-source software
Explain how TQM techniques can help in improving data quality.
In light of increasing legislation dictating how an organization is to store data, what would be your requirements for the role of chief data officer?
You have been asked to write a brief report on how TQM can be adopted by your organization to improve data quality. Produce a list of reasons why TQM
The mail order firm described in Problem and Exercise 12-33 has about 1 million customers. The firm is planning a mass mailing of its spring sales
The average annual revenue per customer for the mail order firm described in Problems and Exercises 12-33 and 12-35 is $100. The organization is
For each situation presented below, illustrate a document as depicted in Figures 10-4 and 10-5 and specify whether it contains an array, an embedded
Write two HIVE queries, the first to create a PRODUCT table with fields Prod ID, Name, Seller, Price; the second to load data into the table from
Figure 10-14 describes a simple Hadoop architecture. If a real-world system is implemented using this approach, it will suffer from a specific
Assume that the following data regarding Students need to be stored—Name: First Name and Last Name, Roll Number, and Mobile Number Illustrate with
Review Figure 10-5 (a). Write a MongoDB query to display all products with review ratings greater than 3 stars and suppress the fields “height”
Research some tools that perform data scrubbing. What tool would you recommend for the Fitch wood Insurance Company?
Create a star schema for this case study. How did you handle the time dimension?
A pharmaceutical retail store manages its current sales, procurement, and materials availability at the store through Excel sheets. Owing to the
A firm wants to reduce fluid drilling costs substantially by increasing drilling fluid efficiency. Research finds that both fluid drilling speed and
Explain through common examples why determining grain is critical.
Visit an organization that has implemented a database management system and interview concerned individuals about the standards and regulations used
Get In Touch