0% found this document useful (0 votes)

23 views27 pages

NO SQL-Unit 3

Uploaded by

ananthdumpa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views27 pages

NO SQL-Unit 3

Uploaded by

ananthdumpa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Unit - 3

Column-Oriented Databases
What Is a Column-Family Data Store

● A column store database is a type of

database that stores data using a column
oriented model.
● Columns store databases use a concept
called a keyspace. A keyspace is kind of
like a schema in the relational model. The
keyspace contains all the column families
(kind of like tables in the relational
model), which contain rows, which
contain columns.
Ex:
Here’s a breakdown of each element in the row:

● Row Key. Each row has a unique key, which is a unique identifier for that row.
● Column. Each column contains a name, a value, and timestamp.
● Name. This is the name of the name/value pair.
● Value. This is the value of the name/value pair.
● Timestamp. This provides the date and time that the data was inserted. This can
be used to determine the most recent version of data.
Benefits
● Compression. Column stores are very efficient at data compression and/or
partitioning.
● Aggregation queries. Due to their structure, columnar databases perform particularly
well with aggregation queries (such as SUM, COUNT, AVG, etc).
● Scalability. Columnar databases are very scalable. They are well suited to massively
parallel processing (MPP), which involves having data spread across a large cluster
of machines – often thousands of machines.
● Fast to load and query. Columnar stores can be loaded extremely fast. A billion row
table could be loaded within a few seconds. You can start querying and analysing
almost immediately.
Examples of Column Store DBMSs

● Bigtable
● Cassandra
● HBase
● Vertica
● Druid
● Accumulo
● Hypertable
● Apache Cassandra is an open source, distributed and decentralized/distributed storage system
(database), for managing very large amounts of structured data spread out across the world. It
provides highly available service with no single point of failure.
● Notable points
○ Apache Cassandra was initially designed at Facebook to implement a combination of
Amazon’s Dynamo distributed storage and replication techniques and Google’s Bigtable data
and storage engine model.
○ It was open-sourced by Facebook in July 2008.
○ Cassandra was accepted into Apache Incubator in March 2009.
○ It is a column-oriented NoSQL database
○ Scalable, fault-tolerant and consistent
○ Cassandra implements a Dynamo-style replication model with no single point of failure, but
adds a more powerful “column family” data model.
○ Cassandra is being used by some of the biggest companies such as Facebook, Twitter, Cisco,
Rackspace, ebay, Twitter, Netﬂix, and more.
Cassandra Architecture
● Basic Terminology:
○ Node
○ Data center
○ Cluster
● Operations:
○ Read Operation
○ Write Operation
● Storage Engine:
○ CommitLog
○ Memtables
○ SSTables
● Data Replication Strategies
Node
Node:
Node is the basic
component in Apache
Cassandra. It is the place
where actually data is
stored. For Example:As
shown in diagram node
which has IP address
10.0.0.7 contain data
(keyspace which contain
one or more tables).
DataCenter
Data Center is a collection of nodes.

For example:

DC – N1 + N2 + N3 ….

DC: Data Center

N1: Node 1

N2: Node 2

N3: Node 3
Cluster
It is the collection of many data centers.
For example:
C = DC1 + DC2 + DC3….
C: Cluster
DC1: Data Center 1
DC2: Data Center 2
DC3: Data Center 3
Read Operation

● Direct Request
○ Send request to one of the
nodes
● Digest Request
○ Send request to N nodes as
specified in the
CONSISTENCY parameter
● Read - Repair Request
○ Will be triggered in case of
INCONSISTENT nodes to
make them consistent
Write Operation
● Step 1 : Write Operation as soon as we receives request then it is first
dumped into commit log to make sure that data is saved.
● Step-2: Insertion of data into table that is also written in MemTable that holds
the data till it’s get full.
● Step-3: If MemTable reaches its threshold then data is flushed to SS Table.
Replication Strategies
● It is used to ensure that there are no point of failures,
Each data item is replicated at N hosts, where N is the
replication factor configured \per-instance
● There are 2 types of replication strategies
○ Simple Strategy
○ Network Topology STrategy
Simple Strategy
In this Strategy it allows a single
integer RF (replication_factor) to
be defined. It determines the
number of nodes that should
contain a copy of each row. For
example, if replication_factor is
2, then two different nodes
should store a copy of each row.
○ Network Topology STrategy

In this strategy it allows a replication

factor to be specified for each
datacenter in the cluster. Even if your
cluster only uses a single datacenter.
This Strategy should be preferred
over SimpleStrategy to make it easier
to add new physical or virtual
datacenters to the cluster later.
Cassandra Data types
Create KeySpace

CREATE KEYSPACE Demo

WITH replication = {'class':'SimpleStrategy',
'replication_factor' : 2};
Alter KeySpace

ALTER KEYSPACE Demo

WITH replication = {'class':'SimpleStrategy',
'replication_factor' : 3};
Drop KeySpace

DROP KEYSPACE Demo;

CRUD operations on tables

Create:
Create table personal details(sid int, sname
text, spg boolean, emails set<text>);
CRUD operations on tables

Read
Select * from personal;
CRUD operations on tables

Update
Update personal set spg=false where
sid=123
CRUD operations on tables

Delete
Delete from personal where sid=143;
Suitable use cases
● Event Logging

● Content Management Systems, Blogging Platforms

● Counters
● Expiring usage
When not to use

● Databases which requires ACID properties for reads and

writes
● Early prototypes which requires query change, as the cost
of query change is more when compared to schema
change.

NoSQL Apache Cassandra
No ratings yet
NoSQL Apache Cassandra
159 pages
NoSql Unit 2
No ratings yet
NoSql Unit 2
72 pages
Cassandra for Developers
100% (2)
Cassandra for Developers
183 pages
Key - Value - Database - (2) (1) (Read-Only)
No ratings yet
Key - Value - Database - (2) (1) (Read-Only)
48 pages
(D862.Ebook) PDF Download Principles of Textile Testing by Je Booth
50% (2)
(D862.Ebook) PDF Download Principles of Textile Testing by Je Booth
4 pages
Cassandra
No ratings yet
Cassandra
25 pages
9 TH
No ratings yet
9 TH
33 pages
Big Data - No SQL Databases and Related Concepts
100% (1)
Big Data - No SQL Databases and Related Concepts
101 pages
Cassandra
No ratings yet
Cassandra
31 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
29 pages
Column Oriented Database
No ratings yet
Column Oriented Database
45 pages
Cambridge Computer Science For IGCSE Cambridge Course Book 2022 Pages 1
No ratings yet
Cambridge Computer Science For IGCSE Cambridge Course Book 2022 Pages 1
17 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
SS1123 - D2T - Apache Cassandra Overview PDF
100% (1)
SS1123 - D2T - Apache Cassandra Overview PDF
45 pages
DMND Module 2
No ratings yet
DMND Module 2
21 pages
App Ache
No ratings yet
App Ache
55 pages
Nosql 1
No ratings yet
Nosql 1
40 pages
Intro To NoSQL
No ratings yet
Intro To NoSQL
18 pages
Lect26 After
No ratings yet
Lect26 After
28 pages
Lec 10 - Column DB
No ratings yet
Lec 10 - Column DB
34 pages
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
No ratings yet
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
83 pages
Cloud Data Storage
No ratings yet
Cloud Data Storage
47 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
Cassandra Data Model
No ratings yet
Cassandra Data Model
17 pages
Cassendra
100% (1)
Cassendra
21 pages
Cassandra Database Overview
No ratings yet
Cassandra Database Overview
37 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
Unit 2
No ratings yet
Unit 2
26 pages
Nosql: John Paul Ashenfelter CTO/Transitionpoint
No ratings yet
Nosql: John Paul Ashenfelter CTO/Transitionpoint
35 pages
No SQL
No ratings yet
No SQL
32 pages
Installing Ubuntu Server
100% (1)
Installing Ubuntu Server
13 pages
OD 03 PDE Building and Operationalizing Data Processing Systems
No ratings yet
OD 03 PDE Building and Operationalizing Data Processing Systems
34 pages
Introduction To Nosql: Gabriele Pozzani
No ratings yet
Introduction To Nosql: Gabriele Pozzani
49 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
Introduction To NOSQL and Cassandra: @rantav @outbrain
No ratings yet
Introduction To NOSQL and Cassandra: @rantav @outbrain
60 pages
04 Introduction To CassandraDB
No ratings yet
04 Introduction To CassandraDB
19 pages
GASTAT-700 Interface Protcol V1.06 - 180115
No ratings yet
GASTAT-700 Interface Protcol V1.06 - 180115
21 pages
Apache Cassandra Nosql SonuJha 04
No ratings yet
Apache Cassandra Nosql SonuJha 04
14 pages
Facebook Cassandra
No ratings yet
Facebook Cassandra
10 pages
Ccomputing Madurya
No ratings yet
Ccomputing Madurya
20 pages
DBMS 11
No ratings yet
DBMS 11
13 pages
Unit-3 (Iot)
No ratings yet
Unit-3 (Iot)
13 pages
Cassandra Complete Notes
No ratings yet
Cassandra Complete Notes
5 pages
Chapter 10
No ratings yet
Chapter 10
25 pages
Wide-Column Stores: Big Data Management Phil Bartie
No ratings yet
Wide-Column Stores: Big Data Management Phil Bartie
46 pages
cp5293 Big Data Analytics Unit 5 PDF
No ratings yet
cp5293 Big Data Analytics Unit 5 PDF
28 pages
Cassandra: Decentralized Storage System
No ratings yet
Cassandra: Decentralized Storage System
37 pages
Features of Cassandra
No ratings yet
Features of Cassandra
6 pages
NOSQL Data Stores Overview
No ratings yet
NOSQL Data Stores Overview
48 pages
Apache Cassandra: by Chethan Gowda
No ratings yet
Apache Cassandra: by Chethan Gowda
12 pages
Cassandra for Database Developers
No ratings yet
Cassandra for Database Developers
15 pages
Unit 4
No ratings yet
Unit 4
7 pages
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
No ratings yet
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
20 pages
Dzone Refcard 153 Apache Cassandra 2020
No ratings yet
Dzone Refcard 153 Apache Cassandra 2020
11 pages
Apache Cassandra: Het Patel Kajal Patel
No ratings yet
Apache Cassandra: Het Patel Kajal Patel
8 pages
NoSQL vs. Cloud Data Storage Systems
No ratings yet
NoSQL vs. Cloud Data Storage Systems
17 pages
Name Shivam Prasad Reg No. 15BCE1196
No ratings yet
Name Shivam Prasad Reg No. 15BCE1196
8 pages
KAUST Update - November 2022
No ratings yet
KAUST Update - November 2022
26 pages
Cassandra
No ratings yet
Cassandra
6 pages
LaTeX Homework Help Service
100% (1)
LaTeX Homework Help Service
6 pages
Revised Syllabus TY Information Technology W.e.f.ay 2020 21
No ratings yet
Revised Syllabus TY Information Technology W.e.f.ay 2020 21
28 pages
CompTIA A 2009 in Depth 3rd Edition Jean (Jean Andrews) Andrews - Get Instant Access To The Full Ebook Content
100% (1)
CompTIA A 2009 in Depth 3rd Edition Jean (Jean Andrews) Andrews - Get Instant Access To The Full Ebook Content
43 pages
Seminar Topic Nosql
No ratings yet
Seminar Topic Nosql
73 pages
OOPJ Unit 1 Material
No ratings yet
OOPJ Unit 1 Material
37 pages
Cheat Sheet v2
No ratings yet
Cheat Sheet v2
3 pages
Approach 2 - Middleware - SAP ECC or S4HANA BTP
No ratings yet
Approach 2 - Middleware - SAP ECC or S4HANA BTP
20 pages
Cassandra Quick Guide
No ratings yet
Cassandra Quick Guide
60 pages
2013HW70753-EndSemReport-Sagar Agrawal
No ratings yet
2013HW70753-EndSemReport-Sagar Agrawal
56 pages
NoSQL vs Relational Databases Guide
No ratings yet
NoSQL vs Relational Databases Guide
29 pages
NoSQL - Unit2
No ratings yet
NoSQL - Unit2
8 pages
DBMS Lab Syllabus
No ratings yet
DBMS Lab Syllabus
2 pages
What Is A Computer
No ratings yet
What Is A Computer
6 pages
Sayali 2
No ratings yet
Sayali 2
49 pages
Through A Gender Lens: An Empirical Study of Emoji Usage Over Large-Scale Android Users
No ratings yet
Through A Gender Lens: An Empirical Study of Emoji Usage Over Large-Scale Android Users
20 pages
Versa Training Lab Guide: Groups 1 - 2
No ratings yet
Versa Training Lab Guide: Groups 1 - 2
20 pages
DBMS Lab
No ratings yet
DBMS Lab
2 pages
B E Mechatronics
No ratings yet
B E Mechatronics
46 pages
R Studio Notes
No ratings yet
R Studio Notes
10 pages
CSC2071 - Lecture 08 (Classes)
No ratings yet
CSC2071 - Lecture 08 (Classes)
29 pages
Cassandra
No ratings yet
Cassandra
7 pages
RM Plagarism Report
No ratings yet
RM Plagarism Report
10 pages
Resuume
No ratings yet
Resuume
2 pages
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
No ratings yet
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
7 pages
Industrial Temperature Transmitter Guide
No ratings yet
Industrial Temperature Transmitter Guide
3 pages
Monitoring Plant Health Andd Detection of Plant Disease Using Iot
No ratings yet
Monitoring Plant Health Andd Detection of Plant Disease Using Iot
15 pages
Design and Analysis of Algorithms: Course Code No: 20CS5T01
No ratings yet
Design and Analysis of Algorithms: Course Code No: 20CS5T01
3 pages
SG110CX: Multi-MPPT String Inverter For System
No ratings yet
SG110CX: Multi-MPPT String Inverter For System
2 pages
Ect303 Digital Signal Processing, December 2022
No ratings yet
Ect303 Digital Signal Processing, December 2022
3 pages
Going Beyond T-SNE: Exposing Whatlies in Text Embeddings
No ratings yet
Going Beyond T-SNE: Exposing Whatlies in Text Embeddings
8 pages
IPDevice Integration Patch 20200622
No ratings yet
IPDevice Integration Patch 20200622
6 pages
How To Create A Responsive Navigation Menu With Icons
No ratings yet
How To Create A Responsive Navigation Menu With Icons
4 pages
PS.2024.C3.Corte1.Pruebas de Integracion.223204.GallegosBorraz
No ratings yet
PS.2024.C3.Corte1.Pruebas de Integracion.223204.GallegosBorraz
6 pages
Anurag Resume
No ratings yet
Anurag Resume
3 pages
CSE - 4052610-CHS Important-1
No ratings yet
CSE - 4052610-CHS Important-1
2 pages

NO SQL-Unit 3

Uploaded by

NO SQL-Unit 3

Uploaded by

Unit - 3

● A column store database is a type of

DC: Data Center

In this strategy it allows a replication

CREATE KEYSPACE Demo

ALTER KEYSPACE Demo

DROP KEYSPACE Demo;

● Content Management Systems, Blogging Platforms

● Databases which requires ACID properties for reads and

You might also like