Tribhuvan University
SARASWATI MULTIPLE CAMPUS
Department of BCA
Assignment No: 4
Submitted by Submitted To
Name: Amit Neupane Deepak Poudel
Roll no: 21
Contents
1. Database Management System (DBMS):.............................................................................................3
a. Characteristics of Database:..........................................................................................................3
b. Functions of DBMS:.......................................................................................................................3
2. Database Models:................................................................................................................................4
a. ER Model:......................................................................................................................................4
b. HIERARCICAL MODEL:...................................................................................................................4
c. Network Model:............................................................................................................................5
3. Database languages:............................................................................................................................5
a. DDL................................................................................................................................................5
b. DML...............................................................................................................................................5
4. Data Warehouse:.................................................................................................................................5
a. Components of data warehouse are:............................................................................................6
i. Central database:....................................................................................................................6
ii. Data integration:.....................................................................................................................6
iii. Metadata:...............................................................................................................................6
iv. Data warehouse access tools:.................................................................................................6
5. Data Mining.........................................................................................................................................6
a. KDD Process..................................................................................................................................6
i. Developing an understanding of.............................................................................................7
ii. Creating a target data set:......................................................................................................7
iii. Data cleaning and preprocessing............................................................................................7
iv. Data reduction and projection................................................................................................7
v. Choosing the data mining task................................................................................................7
vi. Choosing the data mining algorithm.......................................................................................7
vii. Data mining............................................................................................................................7
viii. Interpreting mined patterns...................................................................................................8
ix. Consolidating discovered knowledge.....................................................................................8
6. DBA......................................................................................................................................................8
a. Responsibilities OF DBA:................................................................................................................8
1. Database Management System (DBMS):
A database management system (DBMS) is system software for creating and
managing databases. A DBMS makes it possible for end users to create, protect, read,
update and delete data in a database. A DBMS generally manipulates the data itself, the
data format, field names, record structure and file structure. It also defines rules to
validate and manipulate this data.
a. Characteristics of Database:
Self-describing nature of a database system.
Insulation between programs and data, and data abstraction.
Support of multiple views of the data.
Sharing of data and multiuser transaction processing.
Restriction of unauthorized access
Backup and recovery facilities.
b. Functions of DBMS:
o Stores the definitions of data and their relationships (metadata) in a data
dictionary, any changes made are automatically recorded in the data dictionary.
o Creates the complex structures required for data storage.
o Transforms entered data to conform to the data structures.
o Creates a security system and enforces security within that system.
o Multi user Access Control
o Performs backup and data recovery procedures to ensure data safety.
o Data Integrity Management.
o Provides access to the data via utility programs and from programming languages
interfaces.
2. Database Models:
a. ER Model:
ER model stands for an Entity-Relationship model. It is a high-level data model.
This model is used to define the data elements and relationship for a specified system.
The ER Model represents real-world entities and the relationships between them. ER
Modeling helps you to analyze data requirements systematically to produce a well-
designed database. So, it is considered a best practice to complete ER modeling before
implementing your database.
For example, suppose we design a school database. In this database, the student
will be an entity with attributes like address, name, id, age, etc. The address can be
another entity with attributes like city, street name, pin code, etc. and there will be a
relationship between them.
b. HIERARCICAL MODEL:
A hierarchical database is a data model in which data is stored in the form
of records and organized into a tree-like structure, or parent-child structure, in
which one parent node can have many child nodes connected through links.
c. Network Model:
Network database management systems (Network DBMSs) are based on a
network data model that allows each record to have multiple parents and multiple child
records. A network database allows flexible relationship model between entities.
simple network database
3. Database languages:
a. DDL
DDL is Data Definition Language which is used to define data structures. For
example: create table, alter table are instructions in SQL. Basic command present in DDL
is CREATE, DROP, RENAME, ALTER etc.
b. DML
DML is Data Manipulation Language which is used to manipulate data itself. For
example: insert, update, delete are instructions in SQL. It is used to add, retrieve or
update the data. BASIC command present in DML is UPDATE, INSERT, MERGE etc. It
is further classified into Procedural and Non-Procedural DML.
4. Data Warehouse:
A data warehouse is a large collection of business data used to help an
organization make decisions. The concept of the data warehouse has existed since the
1980s, when it was developed to help transition data from merely powering operations to
fueling decision support systems that reveal business intelligence. The large amount of
data in data warehouses comes from different places such as internal applications such as
marketing, sales, and finance; customer-facing apps; and external partner systems, among
others.
a. Components of data warehouse are:
i. Central database:
A database serves as the foundation of your data warehouse. Traditionally, these
have been standard relational databases running on premise or in the cloud.
ii. Data integration:
Data is pulled from source systems and modified to align the information for
rapid analytical consumption using a variety of data integration approaches such as ETL
(extract, transform, load) and ELT as well as real-time data replication, bulk-load
processing, data transformation, and data quality and enrichment services.
iii. Metadata:
Metadata is data about your data. It specifies the source, usage, values, and other
features of the data sets in your data warehouse. There is business metadata, which adds
context to your data, and technical metadata, which describes how to access data –
including where it resides and how it is structured.
iv. Data warehouse access tools:
Access tools allow users to interact with the data in your data
warehouse. Examples of access tools include: query and reporting
tools, application development tools, data mining tools, and OLAP
tools.
5. Data Mining
Data mining is a process used by companies to turn raw data into useful
information. By using software to look for patterns in large batches of data, businesses
can learn more about their customers to develop more effective marketing strategies,
increase sales and decrease costs. Data mining depends on effective data collection,
warehousing, and computer processing.
a. KDD Process
i. Developing an understanding of
the application domain
the relevant prior knowledge
the goals of the end-user
ii. Creating a target data set:
selecting a data set, or focusing on a subset of variables, or data
samples, on which discovery is to be performed.
iii. Data cleaning and preprocessing.
Removal of noise or outliers.
Collecting necessary information to model or account for
noise.
Strategies for handling missing data fields.
Accounting for time sequence information and known
changes.
iv. Data reduction and projection.
Finding useful features to represent the data depending on the goal
of the task.
Using dimensionality reduction or transformation methods to
reduce the effective number of variables under consideration or to
find invariant representations for the data.
v. Choosing the data mining task.
Deciding whether the goal of the KDD process is
classification, regression, clustering, etc.
vi. Choosing the data mining algorithm.
Selecting method(s) to be used for searching for patterns in
the data.
Deciding which models and parameters may be appropriate.
Matching a particular data mining method with the overall
criteria of the KDD process.
vii. Data mining.
Searching for patterns of interest in a particular
representational form or a set of such representations as
classification rules or trees, regression, clustering, and so
forth.
viii. Interpreting mined patterns.
ix. Consolidating discovered knowledge.
6. DBA
A database administrator, frequently known just by the acronym DBA, is a role
usually within the Information Technology department, charged with the creation,
maintenance, backups, querying, tuning, user rights assignment and security of an
organization's databases.
a. Responsibilities OF DBA:
i. establish the needs of users and monitor user access and security
ii. monitor performance and manage parameters in order to provide fast
responses to front-end users
iii. map out the conceptual design for a planned database
iv. consider both back-end organization of data and front-end
accessibility for end-users
v. refine the logical design so that it can be translated into a specific
data model
vi. further refining the physical design to meet system storage
requirements
vii. install and test new versions of the database management system
(DBMS)
viii. maintain data standards, including adherence to the Data Protection
Act
ix. carry out capacity planning