0% found this document useful (0 votes)

25 views18 pages

Udbms Notes

The document provides an extensive overview of NoSQL databases, detailing their history, types, advantages, and challenges compared to traditional relational databases. It highlights the emergence of NoSQL due to the demands of big data, scalability, and flexible data models, while also discussing the importance of database integration in various applications. Key topics include the characteristics of different NoSQL types, the value of relational databases, and the complexities of clustering in unstructured databases.

Uploaded by

fifthtry2121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views18 pages

Udbms Notes

Uploaded by

fifthtry2121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

UDBMS

UNIT:1
TOPIC1 :

Introduction: Overview, and History of NoSQL Databases Definition of the Four Types of NoSQL
Database, The Value of Relational Databases, Getting at Persistent Data, Concurrency, Integration,
Impedance Mismatch, Application and Integration Databases, Attack of the Clusters, The Emergence
of NoSQL, Key Points

Overview, and History of NoSQL

Overview of NoSQL

NoSQL (originally referring to "non-SQL" or "non-relational") is a type of database design

that focuses on providing a mechanism for storage and retrieval of data that is modeled in
means other than the tabular relations used in relational databases. Unlike traditional
relational databases that use tables, NoSQL databases use a variety of data models, including
key-value pairs, wide-column stores, graph databases, and document stores1.

History of NoSQL

1. Early Days (1960s-1970s): Before NoSQL, data was stored in flat file systems, which
lacked standardization and made data retrieval difficult. The relational database model
was introduced by Edgar F. Codd in 1970, which standardized data storage but
struggled with handling big data2.
2. Rise of Big Data (2000s): The term "NoSQL" was coined in the early 2000s to
address the limitations of relational databases in handling large-scale data and real-
time web applications. Companies like Google, Amazon, and Facebook began
developing their own NoSQL databases to manage vast amounts of unstructured
data1.
3. Modern Era: Today, NoSQL databases are widely used in big data and real-time web
applications due to their flexibility, scalability, and performance. Popular NoSQL
databases include MongoDB, Cassandra, Redis, and Neo4j2.

Advantages of NoSQL

 Scalability: NoSQL databases are designed to scale out by distributing data across
multiple servers.
 Flexibility: They can handle unstructured and semi-structured data, making them
suitable for various types of applications.
 Performance: NoSQL databases often provide faster read/write operations compared
to traditional relational databases.
Disadvantages of NoSQL

 Lack of Standardization: Different NoSQL databases use different query languages

and data models, which can make it challenging to switch between them.
 Consistency: Many NoSQL databases prioritize availability and partition tolerance
over consistency, which can lead to eventual consistency rather than immediate
consistency.

NoSQL databases have revolutionized the way we store and manage data, especially in the
era of big data and real-time applications. Understanding their history and advantages can
help you appreciate their role in modern technology.

Topic 2:

Types of NoSQL Databases

NoSQL databases are non-relational databases designed to handle large volumes of

structured, semi-structured, and unstructured data. Unlike traditional relational databases,
NoSQL databases are more flexible and scalable, making them suitable for modern
applications. There are four primary types of NoSQL databases:

1. Document-Oriented Databases

 Description: These databases store data as documents, usually in JSON, BSON, or

XML format. Each document represents a record and contains a unique key-value
pair.
 Features:
o Schema flexibility: Each document can have its own structure.
o Ideal for hierarchical data storage.
o Enables efficient querying and indexing.
 Use Cases: Content management systems, e-commerce platforms, and real-time
analytics.
 Examples: MongoDB, CouchDB.

2. Key-Value Stores

 Description: These databases store data as a collection of key-value pairs, where each
key is unique and maps to a specific value.
 Features:
o Extremely simple data model.
o Highly performant for read and write operations.
o Excellent for caching and session management.
 Use Cases: Caching, shopping cart data, and user session data.
 Examples: Redis, Amazon DynamoDB.
3. Column-Oriented Databases

 Description: These databases store data in columns instead of rows, allowing for
efficient storage and retrieval of large datasets.
 Features:
o High scalability for handling large data volumes.
o Optimized for read-heavy operations and analytics.
o Efficient for storing sparse data.
 Use Cases: Data warehousing, business intelligence, and real-time analytics.
 Examples: Apache Cassandra, HBase.

4. Graph Databases

 Description: These databases are designed to represent and traverse relationships

between data points as nodes and edges in a graph.
 Features:
o Designed for complex, interconnected datasets.
o Efficient querying of relationships and patterns.
o Provides advanced traversal algorithms.
 Use Cases: Social networks, recommendation engines, and fraud detection.
 Examples: Neo4j, Amazon Neptune.

Topic 3:

The Value of Relational Databases:

Relational databases have been a fundamental technology in managing data for decades.
They offer several key benefits:

Data Integrity

 ACID Properties: Ensure data accuracy and reliability through Atomicity,

Consistency, Isolation, and Durability.
 Constraints (Primary key, foreign key, not null, check, unique) : Enforce rules
like foreign keys and unique constraints to maintain data integrity.

Flexibility in Queries

 SQL: A powerful language for querying and manipulating data.

 AdHoc Queries: Easily create and execute complex queries to retrieve specific data.
Standardization

 Schema Design: Clear structure and design make data easy to understand and
manage.
 Interoperability: Standard SQL language and tools enable integration with various
applications.

Transaction Management

 Reliable Transactions: Manage transactions efficiently to ensure data consistency.

 Concurrency Control: Handle multiple transactions simultaneously without
conflicts.

Security

 Access Control: Restrict data access to authorized users.

 Data Encryption: Protect data through encryption techniques.

Getting at Persistent Data

Persistent data is data that is stored in a non-volatile storage medium, ensuring it is retained
even when the system is powered off.

Durability

 Transaction Logging: Records changes to ensure data can be recovered after a

failure.
 Backup and Recovery: Regular backups and recovery procedures ensure data
persistence.

Storage Mechanisms

 Tables: Data is organized into structured tables for easy access and querying.
 Indexes: Enhance the speed of data retrieval operations.

Data Redundancy

 Replication: Copies data across multiple servers to ensure availability and

redundancy.
 RAID Configurations: Improve data protection and recovery.

Concurrency

Concurrency in databases refers to the ability to handle multiple transactions at the same
time.
Concurrency Control

 Isolation Levels: Ensure transactions are executed in a manner that they do not affect
each other.
 Locking Mechanisms: Prevent data conflicts by locking data during transactions.
 Optimistic and Pessimistic Locking: Different strategies to manage concurrent
transactions.

Integration

Integration refers to the ability of databases to work with other systems and applications.

Data Integration

 ETL Processes: Extract, Transform, Load processes to integrate data from various
sources into the database.
 APIs: Application Programming Interfaces allow different systems to interact with
the database.

Interoperability

 Standard Protocols: Use of standard protocols like ODBC(Open Database

Connectivity) and JDBC(Java Database Connectivity) for database connectivity.
 Middleware: Software that enables communication and data management between
different systems.

Impedance Mismatch

Impedance mismatch occurs when there is a disconnect between the way data is represented
in the database and how it is represented in application code.

Object-Relational Mapping (ORM)

 ORM Tools: Tools like Hibernate and Entity Framework map objects in the code to
database tables, reducing impedance mismatch.
 Advantages: Simplify data manipulation and reduce the amount of boilerplate code
needed for database operations.

Challenges

 Performance Overhead: ORM can introduce performance overhead due to

additional abstraction layers.
 Complex Mappings: Complex data models can be challenging to map accurately.
Topic 4: Application and Integration Databases

Introduction
Application and integration databases are crucial components in modern software systems
that support data storage, retrieval, and sharing across multiple applications and systems.
These databases facilitate seamless communication and data exchange, enabling the
integration of diverse applications in distributed environments.

2. Applications of Databases
Databases are used extensively across various domains for a wide range of applications:

1. E-Commerce: Online shopping platforms rely on databases to store product catalogs,

customer details, transaction history, and inventory information.
2. Healthcare: Medical records, patient history, drug databases, and diagnostic data are
stored in integrated healthcare databases.
3. Education: Educational institutions manage student records, course details, grades,
and schedules using databases.
4. Banking and Finance: Databases are used for customer account management,
transactions, fraud detection, and compliance reporting.
5. Social Media: Platforms like Facebook and Instagram use databases to store user
profiles, posts, messages, and interactions.
6. Manufacturing: Databases track inventory, supply chain logistics, production data,
and equipment maintenance schedules.

3. Integration of Databases
Database integration is the process of combining data from different databases into a unified
view to provide consistent and accurate information to users and applications. It plays a key
role in:

1. Enterprise Resource Planning (ERP): ERP systems integrate data from finance,
HR, procurement, and other modules to ensure seamless operations.
2. Customer Relationship Management (CRM): Integrating customer data from
various touchpoints (social media, digital advertisement,friends and families,
company blog, customer reviews) improves customer service and sales strategies.
3. Supply Chain Management (SCM): Integration ensures visibility across the supply
chain, from suppliers to customers. Supply chain management includes all activities
that turn raw materials into finished goods and put them into customers' hands. This
can include sourcing, design, production, warehousing, shipping, and distribution.
The goal of SCM is to improve efficiency, quality, productivity, and customer
satisfaction.
4. Business Intelligence (BI): Integrated databases provide the foundation for BI tools
to generate insights through data analytics. Exa: PowerBI, SAP BO(Business
Business Object business intelligence)
4. Techniques for Database Integration

1. Manual Integration: Data is extracted, transformed, and loaded manually, often for
small-scale or temporary projects.
2. Middleware Solutions: Middleware software acts as a bridge between applications
and databases, facilitating communication and data exchange.
3. ETL (Extract, Transform, Load): Data is extracted from multiple sources,
transformed into a uniform format, and loaded into a target database.
4. API-Based Integration: APIs allow real-time data sharing and interaction between
applications and databases.
5. Data Federation: A virtual database integrates data from multiple sources without
physically combining them, providing a unified view.

5. Challenges in Database Integration

1. Data Heterogeneity: Different databases may have incompatible formats, schemas,

and data models.
2. Scalability: Integrating large and growing datasets requires robust systems.
3. Security and Privacy: Sensitive data integration needs to comply with data
protection laws and ensure secure transmission.
4. Real-Time Integration: Ensuring data consistency and synchronization in real-time
systems is complex.
5. Cost: Implementing and maintaining integrated database systems can be expensive.

6. Benefits of Database Integration

1. Improved Data Accessibility: Users and applications can access data seamlessly
from various sources.
2. Enhanced Decision-Making: Integrated data provides a holistic view, enabling better
analysis and insights.
3. Operational Efficiency: Automating data flow between systems reduces manual
efforts and errors.
4. Scalability: Integrated databases support business growth by managing increased data
and user demands.
5. Customer Experience: Unified customer data enables personalized services and
improved interactions.

7. Examples of Integrated Database Systems

1. Google BigQuery: A serverless data warehouse that integrates with multiple data
sources for analytics.
2. Microsoft Azure Data Factory: A cloud-based integration service for data
transformation and movement.
3. Oracle Integration Cloud: Provides tools to integrate databases, applications, and
processes.
4. Apache Kafka: A distributed event streaming platform that supports real-time data
integration.

8. Future Trends in Database Integration

1. Artificial Intelligence (AI): AI is being used to automate and optimize database

integration processes.
2. Cloud Integration: Cloud-based integration platforms are becoming more prevalent
due to scalability and cost efficiency.
3. Blockchain: Secure and transparent integration of distributed data sources is
facilitated by blockchain technology.
4. Internet of Things (IoT): Integration of IoT data from various devices into central
databases is gaining importance.

Topic 5: Attack of the Clusters:

The phrase "Attack of the Clusters" in the context of unstructured databases likely refers to
the challenges and complexities that arise when dealing with clustered NoSQL databases.
Here's a brief overview:

Unstructured Databases and Clustering

Unstructured databases, such as NoSQL databases, are designed to handle unstructured data
like JSON, XML, or binary data. They are often used for big data applications, real-time web
apps, and content management systems.

Clustering in NoSQL Databases

Clustering in NoSQL databases involves distributing data across multiple servers (nodes) to
improve performance, scalability, and fault tolerance. However, managing clusters can
introduce several challenges:

1. Data Distribution

 Sharding: Splitting data into smaller chunks (shards) and distributing them across
different nodes.
 Replication: Creating copies of data on multiple nodes to ensure availability and
redundancy.
2. Consistency

 Eventual Consistency: Ensuring that all nodes eventually reach the same state, but
not necessarily immediately.
 CAP Theorem: Balancing Consistency, Availability, and Partition Tolerance in
distributed systems.

3. Scalability

 Horizontal Scaling: Adding more nodes to the cluster to handle increased data and
load.
 Vertical Scaling: Upgrading the hardware of existing nodes to improve performance.

Challenges of Clustering

 Complexity: Managing a cluster involves dealing with network communication, data

synchronization, and failure recovery.
 Security: Ensuring data security across multiple nodes and preventing unauthorized
access.
 Maintenance: Regular maintenance tasks like rebalancing data, updating software,
and monitoring performance.

Benefits of Clustering

 Improved Performance: Distributing data and workload across multiple nodes can
significantly enhance performance.
 High Availability: Replication ensures that data is still accessible even if some nodes
fail.
 Scalability: Clusters can be easily expanded to accommodate growing data and user
demands.

Topic 6: The Emergence of NoSQL:

The Emergence of NoSQL

NoSQL databases have become increasingly popular due to the limitations of traditional
relational databases when it comes to handling the vast amounts of unstructured and semi-
structured data generated by modern applications. Here's a look at the key factors that
contributed to the rise of NoSQL databases:

1. Growth of Big Data

 Volume: The sheer amount of data being generated daily by social media, IoT
devices, and other sources required more scalable solutions.
 Variety: Data types expanded beyond structured tables to include documents, graphs,
and key-value pairs.
 Velocity: The speed at which data needed to be processed and analyzed increased
significantly.
2. Scalability Challenges

 Horizontal Scaling: NoSQL databases are designed to scale out by adding more
servers, unlike relational databases that traditionally scale up by adding more power
to a single server.
 Distributed Systems: NoSQL databases leverage distributed architectures to manage
large volumes of data across multiple nodes.

3. Flexible Data Models

 Schema Flexibility: NoSQL databases do not require a fixed schema, allowing for
more agile development and easier changes to data models.
 Unstructured Data: They can handle unstructured and semi-structured data, making
them suitable for a wider range of applications.

4. High Availability and Fault Tolerance

 Replication: Data is replicated across multiple nodes to ensure high availability and
resilience to failures.
 Consistency Models: Many NoSQL databases prioritize availability and partition
tolerance (CAP Theorem) over strict consistency, offering eventual consistency
models.

5. Cloud Computing

 Elasticity: Cloud platforms provide the infrastructure to support the scalability and
distributed nature of NoSQL databases.
 Cost-Effectiveness: Pay-as-you-go models in cloud computing make it easier to
manage costs associated with scaling databases.

Popular NoSQL Databases

 MongoDB: A document-oriented database that stores data in JSON-like documents.

 Cassandra: A wide-column store known for its high availability and scalability.
 Redis: An in-memory key-value store used for caching and real-time analytics.
 Neo4j: A graph database that excels in handling complex relationships between data
points.

Use Cases

 Social Media: Managing user profiles, posts, and interactions.

 E-commerce: Storing product catalogs, customer information, and transaction data.
 Big Data Analytics: Analyzing large datasets from various sources to derive insights.
 Content Management: Handling diverse content types and metadata.
UNIT:2

Topic1: Differeneces between NoSQL and Relational

database
NoSQL Database RelaRelational Databasetional Database

NoSQL Database supports a very simple query

language.(JSON – Unstructured/Semistructured, Relational Database supports a powerful query
Cassendra query Lang - Structured, Cypher language.(SQL)
Query lang – Structured/Semi structured)

NoSQL Database has no fixed schema. When we

Relational Database has a fixed schema. in
say that a NoSQL database has no fixed schema, we
traditional relational databases, where the structure
mean that it doesn't require a predefined structure for
of the data (tables, columns, data types, etc.
the data.

NoSQL Database is only eventually consistent.

When we say a NoSQL database is "eventually
consistent," it means that after a write operation, the
system does not guarantee immediate consistency
Relational Database follows acid properties.
across all nodes. Instead, the data will become
(Atomicity, Consistency, Isolation, and Durability)
consistent over time. This concept is part of the CAP
theorem, which states that distributed databases can
only achieve two out of three characteristics:
Consistency, Availability, and Partition Tolerance.

NoSQL databases don't support transactions

(support only simple transactions). Traditional ACID
Relational Database supports transactions (also
transactions (Atomicity, Consistency, Isolation,
complex transactions with joins).
Durability) are not fully implemented in many NoSQL
systems.

NoSQL Database is used to handle data coming

in high velocity.( we're referring to data that is Relational Database is used to handle data
generated, processed, and analyzed at a very fast coming in low velocity.
rate)

The NoSQL?s data arrive from many locations..

Data in relational database arrive from one or few
User Interactions: Social media platforms, e-
locations.
commerce websites, and mobile app. Sensors and IoT
Devices: Devices like smart home appliances,
industrial sensors, and wearable technology
continuously generate data streams that need to be
processed and stored. Logs and Events: Systems and
applications produce log files and event data to
monitor activities, performance, and security.
Financial Transactions: Banks and financial
institutions handle high-velocity transaction data from
ATMs, online banking, and trading platforms.
Multimedia Content: Streaming services generate
data from video, audio, and image uploads, views, and
interactions. Geospatial Data: Location-based
services, GPS devices, and mapping applications
generate data related to geographic locations and
movements.

NoSQL database can manage structured, Relational database manages only structured
unstructured and semi-structured data. data.

Relational databases have a single point of failure

with failover. Relational databases are typically
NoSQL databases have no single point of failure. designed to operate with a single primary database
When we say that NoSQL databases have no single server that handles all read and write operations. This
point of failure, we're referring to their distributed setup can lead to a single point of failure (SPOF)
architecture, which ensures that the system remains because if the primary server fails, the entire system
operational even if some nodes (servers) fail. may become unavailable. However, relational
databases can implement failover mechanisms to
mitigate this risk.

NoSQL databases can handle big data or data in NoSQL databases are used to handle moderate
a very high volume . volume of data.

Relational database has centralized structure.

NoSQL has decentralized structure. When we say
When we say that a relational database has a
that NoSQL databases have a decentralized structure,
centralized structure, we are highlighting the
we're highlighting their distributed and scalable
traditional setup where a single primary server (or a
design, which ensures that data and operations are
small number of tightly coupled servers) handles all
spread across multiple servers or nodes
the database operations.

Relational database gives read scalability only.

NoSQL database gives both read and write
When we say that relational databases typically offer
scalability. NoSQL databases are designed to
"read scalability only," we are referring to the
efficiently handle both read and write operations as
common challenges in scaling write operations in
they scale, which is essential for applications dealing traditional relational database management systems
with large volumes of data and high traffic (RDBMS).

NoSQL database is deployed in horizontal

Relation database is deployed in vertical fashion.
fashion.

Topic:2: MongoDB use and Deployment:

MongoDB is widely used in various applications, including:

 Content Management Systems (CMS): Managing and delivering content efficiently.

 Real-time Analytics: Handling large volumes of data for real-time insights.
 Mobile and Social Media Applications: Storing user profiles, posts, and
interactions.
 E-commerce: Managing product catalogs, customer data, and transactions.
 Internet of Things (IoT): Collecting and analyzing data from IoT devices.

Deployment Options:

MongoDB can be deployed in different environments, depending on your needs:

1. Local Deployment:
o Single-Node Replica Set: Ideal for development and testing on a local
machine.
o Installation: MongoDB can be installed directly on your local machine using
the MongoDB Community Server.
2. Cloud Deployment:
o MongoDB Atlas: A fully-managed cloud service that simplifies deployment
and management.
o Deployment Types: Dedicated clusters for production, shared clusters for
development, and flex clusters for small-scale applications.
o Global Clusters: Support location-aware read and write operations for
globally distributed applications.
3. On-Premises Deployment:
o Self-Managed: Deploy MongoDB on your own infrastructure for full control
and customization.
o Configuration: Requires setting up and managing the database servers,
replication, and backups.

Topic:3: Cassandra use and deployment:

Apache Cassandra is a highly scalable and distributed NoSQL database designed for handling
large amounts of data across many commodity servers. Here's an overview of its use and
deployment:
Use Cases:

Cassandra is well-suited for applications that require:

 High Availability: Ensuring data is always accessible, even in the event of hardware
failures.
 Scalability: Handling large volumes of data and high write throughput.
 Flexible Data Modeling: Supporting various data models, including key-value,
document, and column-family.
 Real-time Data Processing: Ideal for applications that need real-time data analysis
and processing.

Deployment Options:

Cassandra can be deployed in various environments:

1. Local Deployment:
o Single-Node Setup: Ideal for development and testing on a local machine.
o Installation: Download and install Cassandra using the binary tarball, Docker
image, or package installation (RPM, YUM, APT).
2. Cloud Deployment:
o Managed Services: Cloud providers like AWS, Google Cloud, and Azure
offer managed Cassandra services, simplifying deployment and management.
o Configuration: Set up clusters, configure replication, and manage resources
through cloud provider tools.
3. On-Premises Deployment:
o Self-Managed Clusters: Deploy Cassandra on your own infrastructure for full
control and customization.
o Configuration: Set up multiple nodes, configure replication, and manage
backups and monitoring.

Topic:4: HBASE use and deployment:

Apache HBase is a distributed, scalable, and column-oriented NoSQL database that runs on
top of the Hadoop Distributed File System (HDFS). It's designed for real-time read/write
access to large datasets. Here's an overview of its use and deployment:

Use Cases:

HBase is commonly used in applications that require:

 Real-time Data Access: Applications that need fast read and write access to large
amounts of data.
 Big Data Analytics: Processing and analyzing large datasets in real-time.
 Content Management Systems: Storing and retrieving large volumes of content.
 Internet of Things (IoT): Managing data from IoT devices.
 Log and Event Data: Storing and querying log files and event data.

Deployment Options:
HBase can be deployed in various environments:

1. Local Deployment:
o Standalone Mode: Ideal for development and testing on a single machine.
o Installation: Download and install HBase using the binary tarball or package
installation (RPM, YUM, APT).
2. Cloud Deployment:
o Managed Services: Cloud providers like AWS, Google Cloud, and Azure
offer managed HBase services, simplifying deployment and management.
o Configuration: Set up clusters, configure replication, and manage resources
through cloud provider tools.
3. On-Premises Deployment:
o Self-Managed Clusters: Deploy HBase on your own infrastructure for full
control and customization.
o Configuration: Set up multiple nodes, configure replication, and manage
backups and monitoring.

Topic: 5: Neo4j use and deployment:

Neo4j is a powerful graph database that excels at managing and querying highly connected
data. Here's an overview of its use and deployment:

Use Cases:

Neo4j is ideal for applications that involve:

 Social Networks: Managing user profiles, relationships, and interactions.

 Recommendation Engines: Providing personalized recommendations based on user
behavior and connections.
 Fraud Detection: Identifying patterns and anomalies in transaction data.
 Network and IT Operations: Managing network topology, configurations, and
dependencies.
 Master Data Management: Integrating and managing data from various sources.

Deployment Options:

Neo4j can be deployed in various environments:

1. Local Deployment:
o Neo4j Desktop: A local development environment for working with Neo4j,
whether using local database instances or databases located on remote servers.
o Docker: Use the official Neo4j Docker image to set up a local database and
deploy it consistently across environments.
2. Cloud Deployment:
o Neo4j AuraDB: A fully managed cloud service that simplifies deployment
and management. It offers flexible plans for different use cases and scales
automatically.
o Cloud Marketplaces: Neo4j is available on AWS, Google Cloud, and
Microsoft Azure through their respective marketplaces.
3. On-Premises Deployment:
o Self-Managed Clusters: Deploy Neo4j on your own infrastructure for full
control and customization. This option is suitable for enterprise-grade
availability and security.
o Configuration: Set up multiple nodes, configure replication, and manage
backups and monitoring.

Topic:6 Challenges NoSQL approach:

Challenges of NoSQL Databases:

1. Lack of Standardization:
o Unlike SQL, which is standardized and widely adopted, NoSQL databases
have different query languages and data models. This can lead to a steeper
learning curve and make it harder to switch between different NoSQL
databases.
2. Data Consistency:
o NoSQL databases often prioritize availability and partition tolerance over
consistency (as per the CAP theorem), leading to eventual consistency. This
can result in temporary inconsistencies, which applications need to handle
appropriately.
3. Complexity in Data Modeling:
o NoSQL databases are highly flexible, but this flexibility can also lead to
complexity in data modeling. Designing an efficient data model that fits the
application's requirements and ensures optimal performance can be
challenging.
4. Limited ACID Transactions:
o Many NoSQL databases provide limited support for ACID (Atomicity,
Consistency, Isolation, Durability) transactions. This can make it difficult to
ensure data integrity and consistency, especially for applications that require
complex multi-document transactions.
5. Tooling and Ecosystem:
o The tooling and ecosystem around NoSQL databases are still evolving. While
they have improved significantly, they may not be as mature or extensive as
those available for traditional relational databases.
6. Data Migration:
o Migrating data from a traditional relational database to a NoSQL database can
be complex and time-consuming. This process often requires significant
planning and may involve rewriting application logic.
7. Management and Maintenance:
o Managing and maintaining a distributed NoSQL database can be more
complex than managing a centralized relational database. This includes tasks
such as configuring replication, handling node failures, and ensuring data
consistency.
8. Security:
o Ensuring the security of NoSQL databases can be challenging, as they may
lack some of the built-in security features present in relational databases.
Implementing proper access controls, encryption, and monitoring is essential.
Topic:7: Key-Value and Document Data Model:

Key-Value:

 Structure: The key-value data model stores data as pairs of keys and values. Each
key is unique and maps directly to a value.
 Simplicity: This model is simple and efficient, making it suitable for applications that
require fast read and write operations.
 Flexibility: The value can be any data type, such as strings, numbers, JSON objects,
or binary data.
 Use Cases: Caching, session management, user preferences, and real-time analytics.

Example:-

Key: user123

Value: {"name": "Alice", "age": 30, "email": "[email protected]"}

Key: product456

Value: {"name": "Laptop", "price": 999.99, "in_stock": true}

Document Data Model:

 Structure: The document data model stores data as documents, typically in JSON or
BSON format. Each document contains key-value pairs and can have nested
structures.
 Schema Flexibility: This model allows for a flexible schema, meaning documents in
the same collection can have different structures.
 Rich Queries: Document databases support rich queries and indexing, making it easy
to retrieve and manipulate complex data.
 Use Cases: Content management systems, e-commerce, mobile applications, and
social networks.

Example: JSON:

"_id": "user123",

"name": "Alice",

"age": 30,

"email": "[email protected]",
"address": {

"street": "123 Main St",

"city": "Wonderland"

"_id": "product456",

"name": "Laptop",

"price": 999.99,

"in_stock": true,

"specs": {

"processor": "Intel i7",

"ram": "16GB",

"storage": "512GB SSD"

CloudComputing DATABASE
No ratings yet
CloudComputing DATABASE
27 pages
Chapter 1 Introduction To Database Concepts
No ratings yet
Chapter 1 Introduction To Database Concepts
44 pages
Unit 2
No ratings yet
Unit 2
25 pages
DBMS PPT 1
No ratings yet
DBMS PPT 1
27 pages
Chapter1 NoSQL Databases
No ratings yet
Chapter1 NoSQL Databases
7 pages
DBMS Notes
No ratings yet
DBMS Notes
85 pages
Databases
No ratings yet
Databases
47 pages
Unit 6
No ratings yet
Unit 6
143 pages
DBMS Unit 5 Macro
No ratings yet
DBMS Unit 5 Macro
3 pages
DATABASE II, Note
No ratings yet
DATABASE II, Note
16 pages
NoSQL Databases
No ratings yet
NoSQL Databases
52 pages
Unit 5
No ratings yet
Unit 5
137 pages
BDA Module 5 - Part1 (No SQL) 2023
No ratings yet
BDA Module 5 - Part1 (No SQL) 2023
32 pages
Unit VI Big Data
No ratings yet
Unit VI Big Data
19 pages
Nosql
No ratings yet
Nosql
64 pages
Nosql Module 1
No ratings yet
Nosql Module 1
23 pages
Unit - I - Nosql
No ratings yet
Unit - I - Nosql
12 pages
Database Management Cheat Sheet
No ratings yet
Database Management Cheat Sheet
78 pages
AD Practical Theory
No ratings yet
AD Practical Theory
8 pages
NoSQL vs SQL: Key Differences Explained
No ratings yet
NoSQL vs SQL: Key Differences Explained
70 pages
Dbms 2. Rdbms 3. History 4. Types 5. Dbms Vs Rdbms
No ratings yet
Dbms 2. Rdbms 3. History 4. Types 5. Dbms Vs Rdbms
24 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
Saf2 Itc
No ratings yet
Saf2 Itc
28 pages
AWS Certified Solutions Architect Associate Practice Test 03
No ratings yet
AWS Certified Solutions Architect Associate Practice Test 03
72 pages
Bda CHP 3
No ratings yet
Bda CHP 3
75 pages
No SQL
No ratings yet
No SQL
109 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
22 pages
S-Advance Database Management System 1
No ratings yet
S-Advance Database Management System 1
68 pages
01 01 Introduction
No ratings yet
01 01 Introduction
20 pages
4.2 NoSQL Databases UNIT-1
No ratings yet
4.2 NoSQL Databases UNIT-1
35 pages
DBMS Unit - III-I
No ratings yet
DBMS Unit - III-I
7 pages
Unit I
No ratings yet
Unit I
13 pages
Database Management System Abhishek
No ratings yet
Database Management System Abhishek
46 pages
009 Databases
No ratings yet
009 Databases
51 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
13 pages
12 Rules & Databases
No ratings yet
12 Rules & Databases
19 pages
D PWF DS 23
No ratings yet
D PWF DS 23
53 pages
Unit 2 Bda Bda
No ratings yet
Unit 2 Bda Bda
29 pages
SQL Unit1
No ratings yet
SQL Unit1
28 pages
Database
No ratings yet
Database
4 pages
Assignment 4 Rdbms
No ratings yet
Assignment 4 Rdbms
18 pages
Case Study About Database Tools
No ratings yet
Case Study About Database Tools
13 pages
102 Copies Adv Lesson 1
No ratings yet
102 Copies Adv Lesson 1
5 pages
Advanced NoSQL for MSc Students
No ratings yet
Advanced NoSQL for MSc Students
26 pages
Introduction to Relational Databases
No ratings yet
Introduction to Relational Databases
24 pages
Introduction To Database Systems
No ratings yet
Introduction To Database Systems
4 pages
Database Systems Overview
No ratings yet
Database Systems Overview
58 pages
UNIT II First Half Notes
No ratings yet
UNIT II First Half Notes
21 pages
Dbms Notes
No ratings yet
Dbms Notes
48 pages
BDA CW Chapter 3
No ratings yet
BDA CW Chapter 3
9 pages
Abdms-Unit 2 and Unit 5 Notes
No ratings yet
Abdms-Unit 2 and Unit 5 Notes
10 pages
Types o Database
No ratings yet
Types o Database
11 pages
Database Types
No ratings yet
Database Types
4 pages
2 BDA A6515 Hadoop
No ratings yet
2 BDA A6515 Hadoop
55 pages
Lecture#02 FileSystemAndDB
No ratings yet
Lecture#02 FileSystemAndDB
39 pages
App Dev Finals
No ratings yet
App Dev Finals
7 pages
Unit No 1
No ratings yet
Unit No 1
34 pages
BDA Unit 2
No ratings yet
BDA Unit 2
30 pages
Types of Databases Explained
No ratings yet
Types of Databases Explained
12 pages
Distributed File Systems: Unit - V Essay Questions
No ratings yet
Distributed File Systems: Unit - V Essay Questions
10 pages
Relational Databases vs. NoSQL
No ratings yet
Relational Databases vs. NoSQL
16 pages
UNIT-IV Advanced Architecture Part-2
No ratings yet
UNIT-IV Advanced Architecture Part-2
52 pages
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
No ratings yet
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
42 pages
AWS Complete
100% (1)
AWS Complete
64 pages
Subhajit Das Cloud Native Development
No ratings yet
Subhajit Das Cloud Native Development
3 pages
Chapter 20: Database System Architectures
No ratings yet
Chapter 20: Database System Architectures
45 pages
Unit 3 - 2 Federation Cloud
100% (1)
Unit 3 - 2 Federation Cloud
10 pages
Exam 70-462: Administering Microsoft SQL Server 2012 Databases
No ratings yet
Exam 70-462: Administering Microsoft SQL Server 2012 Databases
4 pages
Fundamental of Data Base
100% (1)
Fundamental of Data Base
40 pages
Computer Science S-75: Building Dynamic Websites
No ratings yet
Computer Science S-75: Building Dynamic Websites
22 pages
Building Scalable Applications
No ratings yet
Building Scalable Applications
20 pages
Azure Monitoring
No ratings yet
Azure Monitoring
1 page
Unit 4
No ratings yet
Unit 4
14 pages
Bulk Bundles Proposal
No ratings yet
Bulk Bundles Proposal
5 pages
Chapter One and Two
No ratings yet
Chapter One and Two
7 pages
Python Django Interview Preparation Plan
No ratings yet
Python Django Interview Preparation Plan
15 pages
UPDATED Optimizing SQL Server For Temenos T24
No ratings yet
UPDATED Optimizing SQL Server For Temenos T24
68 pages
22CS024, System Design Batch 2023
No ratings yet
22CS024, System Design Batch 2023
10 pages
Unstop Talent Park - Tech Problem Statements
No ratings yet
Unstop Talent Park - Tech Problem Statements
4 pages
GoodEats Project PRD Onyenemezu Clinton
No ratings yet
GoodEats Project PRD Onyenemezu Clinton
10 pages
Splunk and MapReduce
No ratings yet
Splunk and MapReduce
8 pages
A Cloud-Edge Computing Architecture For Monitoring Protective Equipment
No ratings yet
A Cloud-Edge Computing Architecture For Monitoring Protective Equipment
17 pages
IOTiva
No ratings yet
IOTiva
7 pages
Research Proposal Edge Computing Cloud Approach
No ratings yet
Research Proposal Edge Computing Cloud Approach
9 pages
DC - Unit I
No ratings yet
DC - Unit I
55 pages
Assignment - Da Aryan
No ratings yet
Assignment - Da Aryan
5 pages
VDO Comparisson
No ratings yet
VDO Comparisson
38 pages
Rapid Bottleneck Identification: A Better Way To Load Test
No ratings yet
Rapid Bottleneck Identification: A Better Way To Load Test
8 pages
Reducing Container Costs With Kubernetes: White Paper
No ratings yet
Reducing Container Costs With Kubernetes: White Paper
11 pages

Udbms Notes

Uploaded by

Udbms Notes

Uploaded by

UDBMS

Overview, and History of NoSQL

NoSQL (originally referring to "non-SQL" or "non-relational") is a type of database design

 Lack of Standardization: Different NoSQL databases use different query languages

Types of NoSQL Databases

NoSQL databases are non-relational databases designed to handle large volumes of

 Description: These databases store data as documents, usually in JSON, BSON, or

 Description: These databases are designed to represent and traverse relationships

The Value of Relational Databases:

 ACID Properties: Ensure data accuracy and reliability through Atomicity,

 SQL: A powerful language for querying and manipulating data.

 Reliable Transactions: Manage transactions efficiently to ensure data consistency.

 Access Control: Restrict data access to authorized users.

Getting at Persistent Data

 Transaction Logging: Records changes to ensure data can be recovered after a

 Replication: Copies data across multiple servers to ensure availability and

 Standard Protocols: Use of standard protocols like ODBC(Open Database

Object-Relational Mapping (ORM)

 Performance Overhead: ORM can introduce performance overhead due to

1. E-Commerce: Online shopping platforms rely on databases to store product catalogs,

5. Challenges in Database Integration

1. Data Heterogeneity: Different databases may have incompatible formats, schemas,

6. Benefits of Database Integration

7. Examples of Integrated Database Systems

8. Future Trends in Database Integration

1. Artificial Intelligence (AI): AI is being used to automate and optimize database

Topic 5: Attack of the Clusters:

Unstructured Databases and Clustering

Clustering in NoSQL Databases

 Complexity: Managing a cluster involves dealing with network communication, data

Topic 6: The Emergence of NoSQL:

The Emergence of NoSQL

1. Growth of Big Data

3. Flexible Data Models

4. High Availability and Fault Tolerance

Popular NoSQL Databases

 MongoDB: A document-oriented database that stores data in JSON-like documents.

 Social Media: Managing user profiles, posts, and interactions.

Topic1: Differeneces between NoSQL and Relational

NoSQL Database supports a very simple query

NoSQL Database has no fixed schema. When we

NoSQL Database is only eventually consistent.

NoSQL databases don't support transactions

NoSQL Database is used to handle data coming

The NoSQL?s data arrive from many locations..

Relational databases have a single point of failure

Relational database has centralized structure.

Relational database gives read scalability only.

NoSQL database is deployed in horizontal

Topic:2: MongoDB use and Deployment:

MongoDB is widely used in various applications, including:

 Content Management Systems (CMS): Managing and delivering content efficiently.

MongoDB can be deployed in different environments, depending on your needs:

Topic:3: Cassandra use and deployment:

Cassandra is well-suited for applications that require:

Cassandra can be deployed in various environments:

Topic:4: HBASE use and deployment:

HBase is commonly used in applications that require:

Topic: 5: Neo4j use and deployment:

Neo4j is ideal for applications that involve:

 Social Networks: Managing user profiles, relationships, and interactions.

Neo4j can be deployed in various environments:

Topic:6 Challenges NoSQL approach:

Challenges of NoSQL Databases:

Value: {"name": "Alice", "age": 30, "email": "[email protected]"}

Value: {"name": "Laptop", "price": 999.99, "in_stock": true}

Document Data Model:

"street": "123 Main St",

"processor": "Intel i7",

"storage": "512GB SSD"

You might also like