Databases and Information Systems
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 1
Life Without Databases: Lists
• Lists are often sufficient for simple tasks
• Not appropriate for complex information
• Multiple lists lead to
– Data redundancy
– Data inconsistency
– Duplicate data
– Sorting issues
– Incomplete data
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 2
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 3
Databases
• Collections of related data
• Easily stored, sorted, organized, and queried
• Turn data into information
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 4
Advantages of Using Databases
• Store and retrieve large quantities of information
• Enable information sharing
• Provide data centralization
• Promote data integrity
• Allow for flexible use of data.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 5
Copyright © 2010 Pearson Education, Inc.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice HallPublishing as Prentice Hall 6
Disadvantages of Databases
• Complex to construct
• Time consuming
• Expensive
• Privacy concerns
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 7
Database Terminology
• Field: A category of information, displayed in columns
• Record: A group of related fields
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 8
Copyright © 2010 Pearson Education, Inc.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice HallPublishing as Prentice Hall 9
Database Terminology
• Data type: Type of data that can be stored in a field
Data Type Used to Store Example of Data Stored in the Field
Text Alphabetic or alphanumeric data Mary, CIS110
Numeric Numbers 256, 1.347, $5600
Computational Computational formulas Credit hours x per-credit tuition charges
Date Dates in standard date notation 4/15/2012
Memo Long blocks of text Four score and seven years ago our
fathers brought forth on this continent a
new nation, conceived in liberty, and
dedicated to the proposition that all men
are created equal.
Object Multimedia files or documents MP3 file, AVI file
Hyperlink A hyperlink to a Web page on the www.pearsonhighered.com/techinaction
Internet
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 10
Database Terminology
• Table: A group of related records
• Primary key: A field value unique to a record
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 11
Database Types
• Relational databases
– Organize data in tables
– Link tables to each other through their primary
keys
• Object-oriented databases
– Store data in objects
– Also store methods for processing data
– Handle unstructured data
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 12
Database Types
• Multidimensional databases
– Store data in multiple dimensions
– Organize data in a cube format
– Can easily be customized
– Process data much faster
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 13
Database Management Systems (DBMS)
• Application software designed to capture
and analyze data
• Four main operations of a DBMS:
– Creating databases and entering data
– Viewing and sorting data
– Extracting data
– Outputting data
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 14
Creating Databases and
Entering Data
• Create field
names
– Identify each
type
of data
– Data
dictionary (or
database
schema)
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 15
Creating Databases and
Entering Data
• Create
individual
records
– Key in
– Import
– Input form
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 16
Data Validation
• Validation
– Process of ensuring that data entered into
the database is correct (or at least
reasonable) and complete
• Validation rules
– Range checks
– Completeness checks
– Consistency checks
– Alphabetic/numeric checks
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 17
Data Validation
• Example of a completeness check
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 18
Viewing and Sorting Data
• Browse
through
records
• Sort records
by field name
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 19
Extracting or Querying Data
• Query
– A question or
inquiry
– Provides
records based
on criteria
– Structured
Query
Language
(SQL)
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 20
Structured Query Language
• Used to extract records from databases
• Original version developed in mid-1970s
and called SEQUEL
• SQL was introduced as commercial
product by Oracle in 1979.
• Uses relational algebra to extract data
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 21
Outputting Data
• Reports
– Printed (or electronic) output
– Summary data reports
• Export data
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 22
Relational Database Operations
• Relational
databases
organize data
into tables
• Relationships
are links
between tables
with related data
• Common field(s)
need to exist
between tables
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 23
Types of Relationships
• One-to-one
– For each record in a table, only one corresponding record in a
related table
• One-to-many
– Only one instance of a record in one table; many instances in a
related table
• Many-to-many
– Records in one table related to many records in another
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 24
Relational Database Operations
• Normalization of data (recording data
once) reduces data redundancy
• Foreign key: The primary key of one table
is included in another to establish
relationships with that other table
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 25
Data Storage
• Data warehouse
– Large-scale
repository of data
– Organizes all the
data related to an
organization
– Data organized
by subject
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 26
Populating Data Warehouses
• Source data
– Internal sources
• Company databases, etc.
– External sources
• Suppliers, vendors, etc.
– Customers or Web site visitors
• Clickstream data
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 27
Data Staging
• Data staging
– Extract data from source
– Reformat the data
– Store the data
• Software programs and procedures
created to extract the data and reformat
it for storage
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 28
Data Marts
• Small slices of data
• Data for a single department or for use
by specific employee groups
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 29
Data Warehouse Process
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 30
Managing Data:
Information Systems
• Information systems
– Software-based solutions used to gather and
analyze information
• Functions performed by information
systems include
– Acquiring data
– Processing data into information
– Storing data
– Providing output options
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 31
Transaction Processing Systems (TPSs)
• Keep track of
everyday
business
activities
• Batch
processing
• Real-time
processing
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 32
Management Information Systems (MISs)
• Provide timely and accurate information for
managers in making business decisions
• Detail report:
– Transactions that
occur during a
period of time
• Summary report:
– Consolidated
detailed data
• Exception report:
– Unusual
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 33
Decision Support Systems (DSSs)
• Help managers develop solutions for
specific problems
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 34
Model Management Systems
• Software that assists in building management models in
DSSs
• Can be built to describe any business situation
• Typically contain financial and statistical analysis tools
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 35
Knowledge-Based Systems
• Expert system: Replicates human experts
• Natural language processing (NLP) system:
Enables users to communicate with
computers using a natural spoken or written
language
• Artificial intelligence (AI): Branch of
computer science that deals with attempting
to create computers that think like humans
• Support concept of fuzzy logic
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 36
Data Mining
• Process by which great amounts of data
are analyzed and investigated
• Objective is to spot patterns or trends
within the data
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 37
Data Mining Methods
• Classification
– Define data classes
• Estimation
– Assign a value to data
• Affinity grouping or association rules
– Determine which data goes together
• Clustering
– Organize data into subgroups
• Description and visualization
– Get a clear picture of what is happening
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 38
Data Ethics
• Is data private any more?
• Daily life is recorded in many disparate
databases
– Credit card transactions
– Banking transactions
– Frequent buyer cards
– Toll records
– Prescription history and medical records
• Data convergence
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 39
Protecting Your Data
• What can you do? Ask the following questions:
– For what purpose is the data being gathered?
– Are the reasons for gathering the data legitimate or
important to you?
– How will the information gathered be protected once it
has been obtained?
– Will the information collected be used for purposes
other than those for which it was originally collected?
– Could the information asked for be used for identity
theft?
– Are organizations that already have your data
safeguarding it?
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 40