New Semester
Started
Get
50% OFF
Study Help!
--h --m --s
Claim Now
Question Answers
Textbooks
Find textbooks, questions and answers
Oops, something went wrong!
Change your search query and then try again
S
Books
FREE
Study Help
Expert Questions
Accounting
General Management
Mathematics
Finance
Organizational Behaviour
Law
Physics
Operating System
Management Leadership
Sociology
Programming
Marketing
Database
Computer Network
Economics
Textbooks Solutions
Accounting
Managerial Accounting
Management Leadership
Cost Accounting
Statistics
Business Law
Corporate Finance
Finance
Economics
Auditing
Tutors
Online Tutors
Find a Tutor
Hire a Tutor
Become a Tutor
AI Tutor
AI Study Planner
NEW
Sell Books
Search
Search
Sign In
Register
study help
business
database management systems
Modern Database Management 12th Global Edition Topi Hoffer, Venkataraman - Solutions
Describe the mechanism through which prescriptive analytics is dependent on descriptive and predictive analytics.
Provide examples of how data mining can be applied to call detail records (CDRs) of mobile phone users for the purpose of usage analysis and target marketing.
State at least seven typical data mining applications.
What types of skills are essential for competency in predictive modeling?
HomeMed is a multinational company that specializes in medical equipment for homecare. The company would like to have a dashboard to display statistics captured and monitored by medical devices over a user-selected period of time. What type of analytics is used to fulfil the company’s objective?
OLAP involves three operations: slicing and dicing, data pivoting, and drill-down. Which of these operations involves rotating the view for a particular data point to obtain another perspective?
Describe how data cube is relevant in OLAP in the context of descriptive analytics.
Discuss the impact that the emergence of Internet of Things will have on the need for advanced big data and analytics technologies.
Describe the differences between descriptive, predictive, and prescriptive analytics.
Explain the progression from decision support systems to analytics through business intelligence.
HDase and Cassandra share a common purpose. What is this purpose? What is their relationship to HDFS and Google BigTable?
Explain the core principle of the MapReduce algorithm.
Describe the roles of HDFS in the Hadoop architecture.
Describe and explain the two main components of MapReduce which is a part of the Hadoop architecture.
Why is massively parallel processing very important in the context of big data?
Explain the relationship between Hadoop and MapReduce.
What are the key capabilities of NoSQL that extend what SQL can do?
What is the other format that can be used to describe database schema, besides JSON?
What is the difference between wide-column store and graph-oriented database?
What is the trade-off one needs to consider in using a NoSQL database management system?
What is the difference between explanatory and exploratory goals of data mining?
List the differences between the two categories of technology, Hadoop and NoSQL , which have become core infrastructure elements of big data solutions.
What are the two challenges faced in visualizing big data?
Identify and briefly describe the five Vs that are often used to define big data.
Contrast the following terms:a. Data mining; text miningb. Pig; Hivec. ROLAP; MOLAPd. NoSQL; SQLe. Data lake; data warehouse
Define each of the following terms:a. Hadoopb. MapReducec. HDFSd. NoSQLe. Pigf. data mining g. online analytical processing h. business intelligence
Describe the impact of advances in analytics on data management technologies and practices.
Articulate the differences between descriptive, predictive, and prescriptive analytics.
List the key technology components of a typical Hadoop environment and describe their uses.
Describe the meaning of big data and the demands big data will place on data management technology.
Choose between relational databases and various types of NoSQL databases depending on the organization’s data management needs.
List the main categories of NoSQL database management systems.
Describe the reasons why data management technologies and approaches have expanded beyond relational databases and data warehousing technologies.
Interview a data administrator in an organization that has established a data governance committee and data stewards. Document the different roles provided by the data administrator(s), data stewards, and data governance committee members. What is the charter for the data governance committee? How
Interview data warehouse managers in an organization where you have contacts about their ETL processes.What lessons did you learn from your interviews about the design of sound ETL processes?
Access the resources at Teradata University Network(www.teradatauniversitynetwork.com) for a Webinar or Webcast (produced after 2007) on the topic of data integration or master data management. Prepare a summary of new ideas introduced in that Webcast that expand on the discussion from this chapter.
Master data management and the related specialty customer data integration are rapidly changing disciplines.Find a recent article or book on these topics (or some other specialty area for master data management, such as in health care, operations, or human resources) and prepare a summary of new
Design a questionnaire to conduct a survey to assess the awareness of data quality program amongst database professionals. You may include design statements from the points mentioned in the text.
Look for at least 2 organizations on the Internet which have switched to EDW. Study their cases carefully and report the findings on the problems with traditional storage, data quality program implemented, data integration approach, extract type used, load modes used, cleansing, and transformation
Discuss at least one example each for Algorithmic, Table Lookup, and Multi-field transformation, other than those discussed in the chapter.
Suppose an institute records the marks of students for each semester. Discuss how aggregation and selection transformation can be applied to yield the final merit list and the candidates eligible for scholarship. Make necessary assumptions.
After some further analysis, you discover that the commission field in the Policies table is updated yearly to reflect changes in the annual commission paid to agents on existing policies. Would knowing this information change the way in which you extract and load data into the data mart from the
What types of data transformations might be needed in order to build the Fitchwood data mart?
Research some tools that perform data scrubbing. What tool would you recommend for the Fitchwood Insurance Company?
What types of data pollution/cleansing problems might occur with the Fitchwood OLTP system data?
The OLTP system data for the Fitchwood Insurance Company is in a series of flat files. What process do you envision would be needed in order to extract the data and create the ERD shown in Figure 9-21? How often should the extraction process be performed? Should it be a static extract or an
Describe some field-level and record-level data transformations that often occur during the ETL process for loading a data warehouse.
Discuss different modes of loading EDW and how it can be carried from the staging area to EDW.
Why is data reconciliation technically most challenging part of building data warehouse.
List five errors and inconsistencies that are commonly found in operational data.
Discuss approaches alternate to data integration to consolidate data.
List six typical characteristics of reconciled data.
What is the role of TQM and modern technology in improving data quality?
What are the major differences between the data federation and data propagation forms of data integration?
Is master data management intended to replace data warehousing?
Why is master data management important in an organization?
Discuss Inmon’s recommendation of improving data quality at original data capture stage.
What are the four dimensions along which the impact of poor quality can be measured?
What is data profiling, and what role does it play in a data quality program?
Describe roles and limitations of data steward and chief data officer.
Identify the possible sources in your university which lead to poor quality data.
Define the eight characteristics of quality data.
Explain the effect of the Sarbanes-Oxley Act on the need for organizations to improve data quality.
Why does quality of data have high stakes in today’s environment?
What are the key components of a data governance program? How does data stewardship relate to data governance?
Is the scope for data governance limited to within a firm?What should the data governance program include?
Contrast the following terms:a. static extract; incremental extractb. data scrubbing; data transformationc. consolidation; federationd. ETL; master data management
Define each of the following terms:a. static extractb. incremental extractc. chief data officerd. master data managemente. refresh mode
Explain the various forms of data transformations needed to prepare data for a data warehouse.
Describe the four steps and activities of the Extract, Transform, and Load (ETL)process for data integration for a data warehouse.
Describe the three types of data integration approaches.
Describe the purpose and role of master data management.
Describe a program for improving data quality in organizations, including data stewardship.
Describe the reasons for poor-quality data in organizations.
Define the characteristics of quality data.
Describe the importance of data quality and list several measures to improve quality.
Describe the importance of data governance and identify key goals of a data governance program.
Visit www.teradatauniversitynetwork.com and use the various business intelligence software products available on this site. Compare the different products, based on the types of business intelligence problems for which they are most appropriate. Also, search the content of this Web site for
Visit the following Web sites. Browse these sites for additional information on data warehouse topics, including case examples of warehouse implementations, descriptions of the latest warehouse-related products, and announcements of conferences and other events.a. The Data Warehousing Institute:
Visit an organization that has developed a data warehouse and interview the data administrator or other key participant. Discuss the following issues:a. How satisfied are users with the data warehouse?In what ways has it improved their decision making?b. Does the warehouse employ a three-tier
GROUP BY by itself creates subtotals by category, and the ROLLUP extension to GROUP BY creates even more categories for subtotals. Using all the orders, do a rollup to get total order amounts by product, sales region, and month and all combinations, including a grand total. Display the results
Because data warehouses and even data marts can become very large, it may be sufficient to work with a subset of data for some analyses. Create a sample of orders from 2004 using the SAMPLE SQL command (which is standard SQL); put a randomized allocation of 10 percent of the rows into the sample.
Using the MDIFF “ordered analytical function” in Teradata SQL (see the Functions and Operators manual), show the differences (label the difference CHANGE) in TOTAL(which you calculated in the previous Problem and Exercise) from quarter to quarter. Hint: You will likely create a derived table
Take the query you scrapped from Problem and Exercise 9-45 and modify it to show only the U.S. region grouped by each quarter, not just for 2005 but for all years available, in order by quarter. Label the total orders by quarter with the heading TOTAL and the region ID simply as ID in the result.
The database you are using was developed by MicroStrategy, a leading business intelligence software vendor. The MicroStrategy software is also available on TUN. Most business intelligence tools generate SQL to retrieve the data they need to produce the reports and charts and to run the models users
Review the metadata file for the db_samwh database and the definitions of the database tables. (You can use SHOW TABLE commands to display the DDL for tables.) Are dimension tables conformed in this data mart? Explain.
Review the metadata file for the db_samwh database and the definitions of the database tables. (You can use SHOW TABLE commands to display the DDL for tables.) Explain what dimension data, if any, are maintained to support slowly changing dimensions. If there are slowly changing dimension data, are
Review the metadata file for the db_samwh database and the definitions of the database tables. (You can use SHOW TABLE commands to display the DDL for tables.) Explain the methods used in this database for modeling hierarchies.Are hierarchies modeled as described in this chapter?
Customers may have relationships with one another (e.g., spouses, parents and children). Redesign your answer to Problem and Exercise 9-40 to accommodate these relationships.Problems and Exercises 9-42 through 9-49 deal with the Sales Analysis Module data mart available on Teradata University
Agents change territories over time. If necessary, redesign your answer to Problem and Exercise 9-39 to handle this changing dimensional data.
Would you prefer to normalize (snowflake) the star schema of your answer to Problem and Exercise 9-38? If so, how and why? Redesign the star schema to accommodate your recommended changes.
Create a star schema for this case study. How did you handle the time dimension?
Visit www.teradatauniversitynetwork.com and download the dimensional modeling tool located under the downloadable software section. (Your instructor will have to give you the current password to access this site.) Use this tool to draw your answers to Problems and Exercises 9-28, 9-30, 9-31, and
Visit www.kimballgroup.com and locate Kimball University Design Tip 175. Study this design tip and suggest which database technology would be preferable for warehouses with data in terabytes and above.
A pharmaceutical retail store manages its current sales, procurement and material availability at store through Excel sheets. The store manager, owing to increase in the number of branches in the city, is now finding this process of data maintenance tedious. He is now banking on the idea of
Pick any one organization, such as banks, or those which indulge in e-commerce and identify operational systems and information systems in these organizations. Then based on your understanding, compare the two systems on the basis of their characteristics. Suggest why there was a need to separate
Employees working in IT organizations are assigned different projects for a specific duration, such as a few months or years. The duration is specified by the project start date and end date in the database. The project location is different for each project, so change in employee location also
Simplified Automobile Insurance Company would like to add a Claims dimension to its star schema(see Problem and Exercise 9-30). Attributes of Claim are ClaimID, ClaimDescription, and ClaimType. Attributes of the fact table are now PolicyPremium, Deductible, and MonthlyClaimTotal.a. Extend the star
You are to construct a star schema for Simplified Automobile Insurance Company (see Kimball, 1996b, for a more realistic example). The relevant dimensions, dimension attributes, and dimension sizes are as follows:InsuredParty Attributes: InsuredPartyID and Name. There is an average of two insured
A table Student stores StudentID, name, date of result and total marks obtained. A student’s information is: StudentID:S876, Name: Sabcd, Date of result: 22/12/14, and Total marks obtained: 650. An update transaction has changed the date and total marks obtained to 15/05/15 and 589 respectively.
Showing 200 - 300
of 3225
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
Last
Step by Step Answers