0% found this document useful (0 votes)

37 views6 pages

Unit 15

The document discusses different types of file organization structures including sequential files, direct files, indexed sequential files, and indexed files. It provides details on sequential file organization, including its structure, operations of insertion, deletion, updating and retrieval. Some disadvantages of sequential file organization are that updates are not easily accommodated, random access is not possible, and adding new fields requires rewriting every record.

Uploaded by

Saddam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views6 pages

Unit 15

Uploaded by

Saddam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

UNIT 15 FILES

Structure Page No.

15.1 Introduction 87
Objectives
15.2 Terminology 87
15.3 File Organisation 88
15.4 Sequential Files 89
Structure
Operations
Disadvantages
Areas of Use
15.5 Direct File Organisation 90
15.6 Indexed Sequential File Organisation 91
15.7 Summary 91
15.8 Solutions/Answers 92

15.1 INTRODUCTION

In this Unit, we will discuss storage data in the computers. The data is stored in computers in the
form of files. We introduce you the basic terminology related to file organisation in Sec. 15.2. In
Sec. 15.3, we discuss various ways in which data is organised in files. In Sec. 15.4 to Sec 15.6,
we will discuss various kinds of file organisations and their advantages and disadvantages. It
will be useful if you recapitulate the units of Block 2 to refresh your knowledge of syntax
related to file handling in C and also Unit 14 of this block on Tree structures.

Objectives
After studying this unit, you should be able to
• define the various terms related to files;
• describe the different ways in which data is organised in files; and
• discuss the advantages and disadvantages types of files.

15.2 TERMINOLOGY

We will now define the terms of the hierarchical structure of data collection stored in computers.

1) Field: It is an elementary data item characterised by its size, length and type.
For example:
Name : a character type of size 10
Age: a numeric type
2) Record: It is a collection of related fields that can be treated as a unit from an applications
point of view.
For example:
A university could use a student record with the fields, university enrolment no., Name
Major subjects
3) File: Data is organised for storage in files. A file is a collection of similar, related records.
It has an identifying name.
For example: “STUDENT” could be a file consisting of student records for all the pupils in
a university. 87
Data Structures 4) Index: An index file corresponds to a data file. It’s records contain a key field and a
Pointer to that record of the data file which has the same value of the key field.
Indexing will be discussed in detail later in the unit.

The data stored in files is accessed by software which can be divided into the following two
categories:
1) User Programs: These are usually written by a Programmer to manipulate retrieved data
in the manner required by the application.
2) File Operations: These deal with the physical movement of data in and out of files. User
programs effectively use file operations through appropriate programming language
syntax. The File Management system manages the independent files and acts as the
software interface between the user programs and the file operations.
File operations can be categorised as-
i) CREATION of the file
ii) INSERTION of records in the file
iii) UPDATION of previously inserted records
iv) RETRIEVAL of previously inserted records
v) DELETION of records
vi) DELETION of the file.

15.3 FILE ORGANISATION

File organisation can most simply be defined as the method of storing Data record in a file and
the subsequent implications on the way these records can be accessed. The factors involved in
selecting a particular file organisation for uses are:

• Ease of retrieval

• Convenience of updates

• Economy of storage

• Reliability

• Security

• Integrity

Different file organisations accord the above factors differing weightages. The choice must be
made depending upon the individual needs of the particular application in question.

We now introduce in brief the various commonly encountered file organisations.

1) Sequential Files
Data records are stored in some specific sequence e.g. order of arrival value of key field
etc. Records of a sequential file cannot be accessed at random i.e. to access the nth record,
one must traverse the preceding (n − 1) records. Sequential files will be dealt with at length
in the next section.
2) Relative Files
Each data record has a fixed place in a relative file. Each record must have associated with
it in integer key value that will help identify this slot. This key, therefore, will be used for
insertion and retrieval of the record. Random as well as sequential access is possible.
88 Relative files can exist only on random access devices like disks.
3) Direct Files Files
These are similar to relative files, except that the key value need not be an integer. The user
can specify keys which make sense to her application.
4) Indexed Sequential Files
An index is added to the sequential file to provide random access. An overflow area needs
to be maintained to permit insertion in sequence.
5) Indexed Files
In this file organisation, no sequence is imposed on the storage of records in the data file,
therefore, no overflow area is needed. The index however, is maintained in strict sequence.
Multiple indexes are allowed on a file to improve access.

15.4 SEQUENTIAL FILES

We will now discuss in detail the sequential file organisation as defined in Sec. 15.2. Sequential
files have data records stored in a specific sequence.

A sequentially organised file may be stored on either a serial-access or a direct-access storage

medium

15.4.1 Structure

To provide the “sequence” required a “key” must be defined for the data records. Usually a field
whose values can uniquely identify data records is selected as the key. If a single field cannot
fulfil this criterion, then a combination of fields can serve as the key. For example in a file,
which keeps student records, a key could be student no.

15.4.2 Operations
1) Insertion: Records must be inserted at the place dictated by the sequence of the keys. As
is obvious, direct insertions into the main data file would lead to frequent rebuilding of the
file. This problem could be mitigated by reserving overflow areas in the file for insertions.
But this leads to wastage of space and also the overflow areas may also be filled.
The common method is to use transaction logging. This works as follows:
i) collect records for insertion in a transaction file in their order of arrival.
ii) when population of the transactions file has ceased, sort the transaction file in the
order of the key of the primary data file.
iii) merge the two files on the basis of the key to get a new copy of the primary sequential
file.
Such insertions are usually done in a batch mode when the activity/program, which
populates the transaction file, have ceased. The structure of the transaction files records
will be identical to that of the primary file.
2) Deletion: Deletion is the reverse process of insertion. The space occupied by the record
should be freed for use. Usually deletion (like-insertion) is not done immediately. The
concerned record is written to a transaction file. At the time of merging the corresponding
data record will be dropped from the primary data file.
3) Updation:Updation is a combination of insertion and deletions. The record with the new
value is inserted and the earlier version deleted. This is also done using transaction files.
4) Retrieval: User programs will often retrieve data for viewing prior to making decisions,
therefore, it is vital that this data reflects the latest state of the data if the merging activity
has not yet taken place.

Retrieval is usually done for a particular value of the key field. Before return in to the user, the
data record should be merged with the transaction record (if any) for that key value. 89
Data Structures The other two operations “creation” and “deletion” of files are achieved by simple programming
language statements.

15.4.3 Disadvantages

Following are some of the disadvantages of sequential file organisation:

• Updates are not easily accommodated

• By definition, random access is not possible

• All records must be structurally identical. If a new field has to be added, then every
record must be rewritten to provide space for the new field.

• Continuous areas may not be possible because both the primary data file and the
transaction file must be looked during merging.

15.4.4 Areas of Use

Sequential files are most frequently used in commercial batch oriented data processing where
there is the concept of a master file to which details are added periodically. Examples of this are
payroll applications.

E6) Describe the record structure to be used for the lending section of a library.

E7) Write a program in ‘C’ language to insert the following records into a file ‘PERSONAL’
- Adam Bede, 47, Engineer - Silas Marner, 50, Doctor
Use a name field of size 20, age field of size 2 and profession field of size 20.

E8) Merge the following, sequenced on NO:

Transactions Master
No. Name No. Name
6 Beta 1 Delta
4 Alpha 2 Lambda
7 Gamma 8 Phi

15.5 DIRECT FILE ORGANISATION

It offers an effective way to organise data when there, is a need to access individual records
directly.

To access a record directly (or random access) a relationship is used to translate the key value
into a physical address. This is called the mapping function R R.(key value)– Address

Direct files are stored on DASD (Direct Access Storage Device)

A calculation if performed on the key value to get an address. This address calculation
technique is often termed as hashing. The calculation applied is called a hash function.

Here we discuss a very commonly used hash function called Division - Remainder

Division-Remainder Hashing

According to this method, key value is divided by an appropriate number, generally a prime
90 number, and the division of remainder is used as the address for the record.
The choice of appropriate divisor may not be so simple. If it is known that the file is to contain n Files
records, then we must, assuming that only one record can be stored a given address, have
divisor n.

Also we may have a very large key space as compared to the address space. Key space refers to
all the possible key values. The address space possibly may not match the actual number of key
values in the file, the size of key space, therefore a one to one mapping may not be there. That is
calculated address may not be unique. It is called Collision, i.e.

R(K1) = R(K2)butK1 6= K2

Two unequal keys have been calculated to have the same address. The keys are called
synonyms.

There are various approaches to handle the problem of collisions. One of these is to hash to
buckets. A bucket is a space that can accommodate multiple records. A discussion on buckets
and other such methods to handle collisions is out of the scope of this Unit.

15.6 INDEXED SEQUENTIAL FILE ORGANISATION

When there is need to access records sequentially by some key value and also to access records
directly by the same key value, the collection of records may be organised in an effective
manned called Indexes Sequential Organisation.

You must be familiar with search process for a word in a language dictionary. The data in the
dictionary is stored in sequential manner. However an index is provided in terms of thumb tabs.
To search for a word we do not search sequentially. We access the index that is the appropriate
thumb tab, locate an approximate location for the word and then proceed to find the word
sequentially.

To implement the concept of indexed sequential file organisations, we consider an approach in

which the index part and data part reside on a separate file. The index file has a tree structure
and data file has a sequential structure. Since the data file is sequenced, it is not necessary for
the index to have an entry for each record following figure shows a sequential file with a
two-level index.

Level 1 of the index holds an entry for each three-record section of the main file. The level 2
indexes level 1 in the same way.

When the new records are inserted in the data file, the sequence of records need to be preserved
and also the index is accordingly updated.

Two approaches are used to implement indexes are static indexes and dynamic indexes.

As the main data file changes due to insertions and deletions, the static index contents may
change but the structure does not change. In case of dynamic indexing approach, insertions and
deletions in the main data file may lead to changes in the index structure.

Both dynamic and static indexing techniques are useful depending on the type of application.

15.7 SUMMARY
This Unit dealt with the methods of physically storing data in the files. The terms - fields,
records and files were defined. The organisation types were introduced.

The various file organisation were discussed. Sequential File Organisation finds in use in
application areas where batch processing is more common. Sequential Files are simple to use 91
Data Structures and can be stored on inexpensive media. They are suitable for applications that require direct
access to only particular records of the collection. They do not provide adequate support for
interactive applications.

In Direct file organisation there exists a predictable relationship between the key used and by
program to identify a particular record and or programmer that record’s location on secondary
storage. A direct file must be stored on a direct access device. Direct files are used extensively
in application areas where interactive processing is used.

An Indexed Sequential file supports both sequential access by key value and direct access to a
particular record given its key value. It is implemented by building an index on top of a
sequential data file that resides on a direct access storage device.

15.8 SOLUTIONS/ANSWERS

1) The following record structure could take care of the general requirements of a lending
library. Member No., Member Name, Book Classification, i.e. Book Name, Author, Issue
Date, Due Date.
2) No model answer is given.
3) (1) Sort Transaction file
No. Name
4 Alpha
6 Beta
7 Gamma
(2) Merge to get
No. Name
1 Delta
2 Lambda
4 Alpha
6 Beta
7 Gamma
8 Phi
If a sequential file on a disc is to occupy the least possible space its records must be stored
continuously i.e. with no unused space between them.
In case of addition or deletion of a record, the file must be rewritten to maintain its
sequential order without spaces.

Mitchell On Demand 5.8.2 With Crack
0% (1)
Mitchell On Demand 5.8.2 With Crack
3 pages
Veeam VMCE - v12 v2024-12-05 q138
No ratings yet
Veeam VMCE - v12 v2024-12-05 q138
59 pages
BS en 14600 2005
No ratings yet
BS en 14600 2005
30 pages
OS-Chapter 5 - File Management
100% (1)
OS-Chapter 5 - File Management
10 pages
Caie A2 Level Computer Science 9618 Theory v1
100% (1)
Caie A2 Level Computer Science 9618 Theory v1
21 pages
7UT633 Settings
No ratings yet
7UT633 Settings
7 pages
Huawei OSN Devices Overview
No ratings yet
Huawei OSN Devices Overview
2 pages
Carenado C90 - GTX - King - Air Normal Procedures
No ratings yet
Carenado C90 - GTX - King - Air Normal Procedures
17 pages
MODULE-5 FILE & Their Organization
No ratings yet
MODULE-5 FILE & Their Organization
13 pages
Unit 12 File Structures: Structure Page Nos
No ratings yet
Unit 12 File Structures: Structure Page Nos
7 pages
Student Feedback Form
No ratings yet
Student Feedback Form
54 pages
Chapter 11 File Management
No ratings yet
Chapter 11 File Management
13 pages
File Structure & Hashing Guide
No ratings yet
File Structure & Hashing Guide
12 pages
Files and Their Organization: Data Hierarchy
No ratings yet
Files and Their Organization: Data Hierarchy
17 pages
Wireless Communication Quiz
No ratings yet
Wireless Communication Quiz
10 pages
DBMS Book Special Notes PDF
No ratings yet
DBMS Book Special Notes PDF
68 pages
ICT Module 1 CSS NC-II
No ratings yet
ICT Module 1 CSS NC-II
27 pages
Fundamental File Structure Concepts & Managing Files of Records
No ratings yet
Fundamental File Structure Concepts & Managing Files of Records
18 pages
1-File Structure
No ratings yet
1-File Structure
17 pages
File Organization Midterm
No ratings yet
File Organization Midterm
43 pages
Fds Notes
No ratings yet
Fds Notes
15 pages
Programming & Algorithms Guide
No ratings yet
Programming & Algorithms Guide
122 pages
Explain File Management in An Operating System
No ratings yet
Explain File Management in An Operating System
57 pages
File Structure
No ratings yet
File Structure
18 pages
File Structure and Organization Concepts
No ratings yet
File Structure and Organization Concepts
17 pages
File Organization
No ratings yet
File Organization
17 pages
Grade 11 - File Organisation and File Access New
No ratings yet
Grade 11 - File Organisation and File Access New
2 pages
File Organization
No ratings yet
File Organization
7 pages
FP-Lecture-6 01
No ratings yet
FP-Lecture-6 01
33 pages
Data Structure Lecture 1
No ratings yet
Data Structure Lecture 1
36 pages
Lecture 3.3.3 Sequential, Relative
No ratings yet
Lecture 3.3.3 Sequential, Relative
16 pages
DSP Lab: Sampling & Aliasing
No ratings yet
DSP Lab: Sampling & Aliasing
5 pages
FJP13007 High Voltage Fast-Switching NPN Power Transistor: Features
No ratings yet
FJP13007 High Voltage Fast-Switching NPN Power Transistor: Features
6 pages
Quectel Cellular Engine: AT Commands
No ratings yet
Quectel Cellular Engine: AT Commands
14 pages
Unit 1 Introduction To Dbms
No ratings yet
Unit 1 Introduction To Dbms
27 pages
File Organization Techniques
No ratings yet
File Organization Techniques
31 pages
CSC 216 - File Organization and Data Processing
No ratings yet
CSC 216 - File Organization and Data Processing
24 pages
File Organization Techniques Guide
No ratings yet
File Organization Techniques Guide
37 pages
2022 - CMP 262 - File Organisation - Slides
No ratings yet
2022 - CMP 262 - File Organisation - Slides
19 pages
Unit 6 File Management
No ratings yet
Unit 6 File Management
70 pages
FDSUNIT4
No ratings yet
FDSUNIT4
6 pages
File Organization for SE Computer Students
No ratings yet
File Organization for SE Computer Students
66 pages
Indicator 11.7.1 Training Module Public Space
No ratings yet
Indicator 11.7.1 Training Module Public Space
39 pages
File Organization-Lec8
No ratings yet
File Organization-Lec8
31 pages
95 8396 6.3 (C7064e)
No ratings yet
95 8396 6.3 (C7064e)
12 pages
Ds Mod 5
No ratings yet
Ds Mod 5
17 pages
Data Structure Unit 5
50% (4)
Data Structure Unit 5
14 pages
Sample Template
No ratings yet
Sample Template
54 pages
File Management in Operating Systems
No ratings yet
File Management in Operating Systems
40 pages
DSA Unit6 Theory
No ratings yet
DSA Unit6 Theory
23 pages
File Organization
No ratings yet
File Organization
5 pages
A 32 Morosanu - Bioetica Si Tehnologie PDF
No ratings yet
A 32 Morosanu - Bioetica Si Tehnologie PDF
3 pages
OSY Chapter 6 SSP
No ratings yet
OSY Chapter 6 SSP
24 pages
SS2 Second Term
No ratings yet
SS2 Second Term
20 pages
Unit 6
No ratings yet
Unit 6
20 pages
DSA Practicle 11
No ratings yet
DSA Practicle 11
5 pages
1505223132foxboro - Idp10s - Brochure
No ratings yet
1505223132foxboro - Idp10s - Brochure
4 pages
Concepts of Computer Files Note
No ratings yet
Concepts of Computer Files Note
2 pages
APV Series Quick Install Guide
No ratings yet
APV Series Quick Install Guide
2 pages
File and Database Design
No ratings yet
File and Database Design
28 pages
PAC-5000A Multi-Zone Mixer Amplifier
No ratings yet
PAC-5000A Multi-Zone Mixer Amplifier
12 pages
PA - SET PA4X KBD Set List
No ratings yet
PA - SET PA4X KBD Set List
10 pages
File Organization in RDBMS
No ratings yet
File Organization in RDBMS
9 pages
CS 1550: Introduction To Operating Systems: Prof. Ahmed Amer
No ratings yet
CS 1550: Introduction To Operating Systems: Prof. Ahmed Amer
33 pages
File Organization
No ratings yet
File Organization
4 pages
Files Structrews
No ratings yet
Files Structrews
9 pages
CAIE-A2 Level-Computer Science - Theory
No ratings yet
CAIE-A2 Level-Computer Science - Theory
27 pages
Week 14 Persistent Data Storage
No ratings yet
Week 14 Persistent Data Storage
7 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
17 pages
6014 Question Paper
No ratings yet
6014 Question Paper
2 pages
WI Install 24.2.0
No ratings yet
WI Install 24.2.0
40 pages
Automotive Electrical Symbols Guide
No ratings yet
Automotive Electrical Symbols Guide
5 pages
Unit 1 Lecture 9
No ratings yet
Unit 1 Lecture 9
22 pages
TOPIC THREE-File System
No ratings yet
TOPIC THREE-File System
15 pages
7.3 Section 3 File Organisation
No ratings yet
7.3 Section 3 File Organisation
7 pages
SAP Disaster Recovery (DR) or Sandbox
No ratings yet
SAP Disaster Recovery (DR) or Sandbox
2 pages
File Organisation
No ratings yet
File Organisation
45 pages
WINSEM2024-25 CBS1003 ETH VL2024250505129 2025-04-08 Reference-Material-I
No ratings yet
WINSEM2024-25 CBS1003 ETH VL2024250505129 2025-04-08 Reference-Material-I
12 pages
Group F - 11
No ratings yet
Group F - 11
4 pages
Otc 31913 Ms
No ratings yet
Otc 31913 Ms
16 pages
Trane TR1VFD
No ratings yet
Trane TR1VFD
4 pages
Lehle-P-ISO Manual EN v1.0
No ratings yet
Lehle-P-ISO Manual EN v1.0
14 pages
Keystone School of Engineering: Group F Assignment - 12
No ratings yet
Keystone School of Engineering: Group F Assignment - 12
4 pages

Unit 15

Uploaded by

Unit 15

Uploaded by

UNIT 15 FILES

Structure Page No.

15.3 FILE ORGANISATION

We now introduce in brief the various commonly encountered file organisations.

15.4 SEQUENTIAL FILES

A sequentially organised file may be stored on either a serial-access or a direct-access storage

Following are some of the disadvantages of sequential file organisation:

• Updates are not easily accommodated

• By definition, random access is not possible

15.4.4 Areas of Use

E8) Merge the following, sequenced on NO:

15.5 DIRECT FILE ORGANISATION

Direct files are stored on DASD (Direct Access Storage Device)

15.6 INDEXED SEQUENTIAL FILE ORGANISATION

To implement the concept of indexed sequential file organisations, we consider an approach in

You might also like