Shared Flashcard Set

Details

ISDS 2001 Test 2
ISDS 2001 Test 2 Catanzaro
35
Computer Science
Undergraduate 3
03/13/2012

Additional Computer Science Flashcards

 


 

Cards

Term
A ___ is a pool of data produced to support decision making enterprise wide.
Definition
Data Warehouse
Term
The four major characteristics of data warehousing are:
Definition
  1. Subject Oriented
  2. Integrated
  3. Time-Variant
  4. Non-Volatile
Term
___ - Characteristic of a DW that says data are organized by topics, such as sales, products, customers, etc. Best for providing a more comprehensive view of the organization; not only how a business is operating, but why.
Definition
Subject Oriented
Term
___ - Characteristic of a DW that says data from different sources are stored in a consistent format. Clarity is obtained in units of measures, naming/labeling of attributes, etc.
Definition
Integrated
Term
___ - Characteristic of a DW where it provides data at various points in time (daily, weekly, monthly, quarterly, annually, historic, etc.) Every data warehouse should have one.
Definition
Time Variant
Term
___ - Characteristic of a DW that says users cannot change the data once entered into the data warehouse. This ensure that the DW is almost exclusively available for access. Obsolete data can be deleted and changes are recorded as new data.
Definition
Non-Volatile
Term

Additional characteristics of a data warehouse may include:

  1. Designed for ___ usage/applications
  2. has __/__ structure
  3. uses __-__ architecture to provide easy access to end user
  4. For newer DWs, allows for ___ access to the data
  5. Contains __ (information that describes what the data is)
Definition

web-based

relational/multidimensional

client-server

real-time

metadata

Term
There are THREE main types of data warehouses
Definition

1. Data Mart

2. Operational Data Stores (ODS)

3. Enterprise Data Warehouses (EDW)

Term
___ - a subset of a data warehouse, usually consisting of a single subject area (marketing, sales, customer satisfaction, inventory, production, etc.)
Definition
Data Mart
Term
 ___ data mart - created directly from the DW. This ensures that the user is viewing/using the same data available to all other users. Its data source is an EDW.
Definition
Dependent Data Mart
Term
__ data mart - a small warehouse designed for a department or strategic business unit whose source is NOT an EDW.
Definition
Independent
Term
_____ - A type of database often used as an interim (or staging) area for a data warehouse, especially for customer information files. Data are updated frequently through the course of business operations as opposed to the static contents of a data warehouse.
Definition
Operational Data Stores (ODS)
Term
__ - a large-scale data warehouse that is used across the enterprise/ company for decision support.  Being large-scaled, it integrates data in standard format from many sources.  (DirecTV, Enterprise Rental use this type of data warehouse).  (It provides data for many types of Decision Support Systems, including CRM, SCM, BPM, BAM, PLM, revenue management, and KMS.)
Definition
Enterprise Data Warehouse (EDW)
Term

Problem: NCR was faced with the challenge to grow into an integrated, solution-oriented business structure with a global focus.

 

Solution?

Definition
Teradata EDW System
Term
___ - Data about data. Describes the contents of a data warehouse, its structure (field names, data type, default value, length) meaning, syntax, and the manner of its use.
Definition
Metadata
Term
The 5 major components of the Data Warehouse Process are:
Definition
  1. Data Sources
  2. ETL (Extraction, Transformation, Load) Process
  3. Comprehensive Database
  4. Metadata
  5. Middleware Tools
Term
__ - transactional data (OLTP) such as CRM and ERP data, web logs from the internet, external data (ex: census data), ACCESS, SQL data (and other formats), legacy systems (reference to outdated computer systems), etc.
Definition
Data Sources
Term
__ - Data are extracted from external data sources using custom ETL software, maintained in a staging area where transformed, cleansed, and integrated, then loaded into the Data Warehouse and/or data marts. (ex: MS SQL Server SSIS)
Definition
ETL (Extraction, Transformation, Load) Process
Term
____ - the EDW used to support all decision analyses.
Definition
Comprehensive Database
Term
___ - are maintained so that it can be used by IT personnel and users; includes software programs with rules for organizing data that can be indexed and searched.
Definition
Metadata
Term
__ - tools that access the contents of the data warehouse. These are the front-end applications that users have to interact with data, including data mining, queries, OLAP, predictive analyses, reporting and visualization tools (ex: MS SQL MS, MS Excel with PowerPivot, and others).
Definition
Middleware Tools
Term
___ - Computer hardware that provides a specific service used by other computers
Definition
Server
Term
___ - a type of server that maintains efficient execution of procedures (programs)
Definition
Application Server
Term
__ - a type of server, sometimes referred to as "the back end", that holds the database or data warehouse.
Definition
Database Server
Term
___ - allows a user to request a server's content or function.
Definition
Client Software
Term

List the 3 components of a 3-Tiered Architecture

 

Advantage: ___

Disadvantage: ___

Definition

Client Workstation

Application Server

Database Server

 

Advantage: separates application and database functions when higher capacity is needed and fast performance is important.

Disadvantage: More hardware = increased costs

Term

List the two components of a two-tiered architecture

 

Advantage:

Disadvantage

Definition

Client Workstation

Application Server AND Database (on same server)

 

Advantage: more economical when the size is not large

Disadvantage: Performance issues when the usage demand is high

Term
The 4 issues considered when deciding on the architecture to use?
Definition
  1. Which DBMS should we use?
  2. Will parallel processing and/or partitioning be used?
  3. Will data migration tools be used to load the DW?
  4. What tools will be used to support data retrieval and analysis?
Term
TeraData Corporation supports the ____ architecture
Definition
Central Data Warehouse
Term
Data integration requires 3 major processes:
Definition

Data Access

Data Federation

Change Capture

Term
List the 5 steps of the ETL Process ("SETIL")
Definition

SELECT

EXTRACT

TRANSFORM

INTEGRATE

LOAD

Term

To ensure a successful DW implementation process, the business must:

a. __

b. __

c. __

d. __

Definition

a. Define the plan (business objectives and strategies)

b. Gather support from managers and end users

c. Set reasonable time frames and budgets

d. Manage expectations

Term

The Inmon Model uses the  ___ approach.

 

The Kimball Model uses the ___ approach.

 

Both ultimately result in an EDW.

Definition

EDW (Top Down)

 

Data Mart Approach

 

Enterprise Data Warehouse

Supporting users have an ad free experience!