Geospatial data : Raster and vector
data models
Unit-II
Course code:GI1908GE
Dr. Midhat Fayaz
Learning objectives
1. The two primary types of spatial data in GIS.
2. To recognize the nature of raster and vector data
models.
3. To understand the difference between raster and
vector data in GIS.
4. To know when should we use raster and when
should we use vector?
Spatial Data
The data or information that identifies the geographic location of features and boundaries on
earth, such as natural and constructed features like ocean, lake, pond etc.
These are in the form of graphic primitives that are usually either points, lines, polygons.
Spatial data is usually stored as coordinate and topology, and is data that can be mapped.
Non-Spatial Data
Data that relate to a specific, precisely defined location. The data are often statistical but may
be text, images or multi-media. These are linked in the GIS to spatial data that define the
location.
Non spatial data (also called attribute or characteristic data) is that information which is
independent of all geometric considerations.
e.g., persons height , mass and age are non spatial data because they are independent of the
persons location.
Spatial data types
Two types of spatial data:
Raster Data: A representation of the world as a surface
divided into a regular grid of cells. Raster models are
useful for storing data that varies continuously as in an
aerial photograph, a satellite image, a surface of chemical
concentrations, or an surface elevation
Vector Data: A representation of the world using point, lines, and
polygons. Vector models are useful for storing data that has
discrete boundaries, such as country borders, land parcels, and
Vector data
A point is a single node.
A line is two nodes with an arc between them
A polygon is a closed group of three or more arcs.
Methods of representing geographic space
Vector
Raster
The diagram below shows how real-world objects can be represented on
a computer monitor by x,y coordinates.
The coordinate pairs 1,5 3,5 5,7 8,8 and 11,7 represent a line (road)
The coordinate pairs 6,5 7,4 9,5 11,3 8,2 5,3 and 6,5 represent a polygon
(lake).
The first and last coordinates of the polygon are the same; a polygon
always closes.
● The real world around us is not uniform and smooth. It is undulating,
irregular and dynamic and to represent that geographically in a digital
mode requires the understanding of Data models.
● The models provide a way of visualizing and researching about the
world in an organised and structured way in a digital domain.
● Data models are a way of organizing any object, area and/or phenomena
that we wish to study.
● There are two essential methods used to store information in a
Geographic Information System -GIS
– Raster Model
– Vector Model
Raster Models
● Raster - The origin of the word "raster" indeed comes from the Latin word “rastrum,”
meaning "to rake," which relates to how an image is constructed line by line, much like raking
over a surface.
● Quantizes or divides space into discrete packets (cells), each representing a part of the
whole
● Cells are of equal size square, rectangular, hexagon, triangles
● Loose the ability to represent exact locations (e.g., point represented as single cell)
● Zero dimensional object rep. with 2D feature
● Lines represented as a series of connected cells
● Multiple cells joined at edges or corners, usually with only 1 or 2 neighbors, 1D objects
represented in 2D
● Areas represented as a series of connected cells
● 2D objects represented in 2D, cells distort area and shape - stairs-stepped appearance
Raster Models-continue
●Two general ways of associating attribute data with raster
entities
● 1. store an attribute for every grid cell problem is redundancy in
storage
●2. link cells to RDBMS
●Permits more than one attribute to be associated for a single
cell
●Only have to store attributes once
●Cell value linked to attribute table
●Essentially many to one - "many cells being linked to one record
Generic structure for a grid
Grid extent
Grid
cell
s
w
o
R
Resolution
Columns
● Like the vector data model, the raster data model can represent discrete point, line and
area features.
● A point feature is represented as a value in a single cell, a linear feature as a series of
Raster Data Model Advantages
● Better for storing Image data.
● A powerful format for statistical and spatial analysis.
● Easy and efficient overlaying.
● Simple Data Structure.
● Same Grid Cell for several attributes.
Raster Data Model Disadvantages
● Dataset can be large, storage space can be a problem.
● Network analysis is difficult to perform.
● Loss of information when using large cells.
● Insufficient projection transformation.
● Difficult in a representation of Topology connections.
Raster Data Model Applications
● Satellite imagery analysis.
● Elevation modeling.
● Weather forecasting.
● Environmental monitoring.
1. Spatial Resolution Characteristics of Raster Data Model
One of the most important characteristics of raster data is spatial resolution. Spatial resolution refers to the
size of the cells in the grid. The smaller the cell size, the higher the spatial resolution. High spatial resolution
data is useful for analyzing small areas or features, while low spatial resolution data is better for analyzing
larger areas.
2. Data Volume
Another characteristic of raster data is its data volume. Raster data can contain a large amount of data,
especially if the spatial resolution is high. This can make working with raster data computationally intensive,
as it requires a large amount of processing power and memory.
3. Data Type
Raster data can come in a variety of data types, including integer, float, and binary. Integer data represents
whole numbers, while float data represents decimal numbers. Binary data is used to represent presence or
absence, such as in land cover classifications.
4. Spatial Referencing
Spatial referencing is another important characteristic of raster data. Raster data must be spatially
referenced in order to be used in GIS. This means that each cell in the grid must be associated with a specific
geographic location. This allows the data to be correctly located in space and used in conjunction with other
spatial data.
5. Interpolation
Interpolation is the process of estimating values for locations where data is not available. Interpolation is
commonly used in raster data to fill in gaps or create a smoother surface. However, interpolation can
introduce errors and should be used with caution.
Characteristics of Raster Data Model
6. Resolution vs. Accuracy
It’s important to note that spatial resolution does not always equal accuracy. While high spatial
resolution data can provide more detail, it does not necessarily mean that the data is more
accurate. Accuracy is determined by the quality of the data source and the processing methods
used.
7. Data Formats
Raster data can be stored in a variety of formats, including TIFF, JPEG, and GeoTIFF. Each format
has its own advantages and disadvantages, depending on the intended use of the data.
8. Band Count
Raster data can also contain multiple bands, with each band representing a different attribute of
the geographic area. For example, a satellite image may contain multiple bands that represent
different wavelengths of light.
9. Projection
Raster data must be projected in order to be used with other spatial data. This means that the data
must be transformed from its original coordinate system to a common coordinate system that
can be used with other data.
10. Metadata
Finally, raster data should always be accompanied by metadata, which provides important
Vector Models
● Features represented in basically the same way as an analog map, permits more precise representation than
raster model, permits "empty space”, variations of the vector model
● Spaghetti models
● Simplest of vector data structures
● Does not explicitly store spatial relationships (topology), essentially X,Y coordinates, and which should be
connected by lines
● Doesn’t really "know" if points and connected lines form a line entity or poly entity
● Topological models
● Recognizes the concept of an entity
● Stores spatial relationship information explicitly associated with each entity, most common in GIS
Feature Geometry
To keep track of many features, each is assigned a unique identification number or tag.
Then, the list of coordinates for each feature is associated with the feature’s tag.
The objects you see in a vector theme are actually saved in the theme table
Vector Data Model Advantages
1.Compact data structure – Need less space for storing data.
2.Accurate Graphic output.
3.Since most information, e.g. printed version maps, is in vector form no data
conversion is required.
4.Exact geographic location of data is maintained.
5.Easily make a connection between topology and network, efficient for network
analysis.
Vector Data Model Disadvantages
6.The location of each vertex needs to be stored explicitly.
7.It has a complex Data Structure.
8.Difficult overlay operations.
9.High spatial variability is inefficiently represented.
10. Spatial analysis and filtering within polygons are impossible.
Vector Data Model Applications
11. Road network analysis.
12. Parcel mapping.
13. Administrative boundary delineation.
Characteristics of Vector Data
1. Accuracy
One of the main advantages of vector data is its accuracy. Because vector data represents objects
using mathematical points, lines, and polygons, it can be very precise. This makes it useful for
applications that require high levels of accuracy, such as surveying, engineering, and urban
planning.
2. Scalability
Another characteristic of vector data is its scalability. Vector data can be scaled up or down
without losing detail or clarity. This means that it can be used to represent both large and small
areas with equal accuracy. For example, a vector map of a city can be zoomed out to show the
entire city, or zoomed in to show individual streets and buildings.
3. Flexibility
Vector data is also very flexible. It can be easily manipulated and edited using GIS software. This
means that it can be customized to meet specific needs or requirements. For example, a GIS user
could edit a vector map to highlight certain features or add new ones.
4. Topology
Topology is another important characteristic of vector data. Topology refers to the relationships
between different features in a map. For example, the topology of a road network would include
the connections between roads, intersections, and highways. Topology is important because it
allows GIS users to perform spatial analysis, such as routing and network analysis.
5. Attribute Data
In addition to spatial data, vector data also includes attribute data. Attribute
data is non-spatial data that is associated with a particular object or feature.
For example, a vector map of a city might include attribute data such as the
population of each neighborhood or the number of businesses in each area.
Attribute data is important because it allows GIS users to perform statistical
analysis and make informed decisions.
6. Data Storage
Vector data is typically stored in a database or file format. Common file
formats for vector data include shapefile, GeoJSON, and KML. The choice of
file format depends on the specific needs of the GIS user and the
capabilities of the GIS software being used.
Differences between Raster Data Model and Vector Data Model
Vector Representation
Vector to Raster
Raster Representation
The mixed pixel problem