0% found this document useful (0 votes)

25 views61 pages

Intro To Cassandra For Developers

Uploaded by

Adithya ghost

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views61 pages

Intro To Cassandra For Developers

Uploaded by

Adithya ghost

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 61

Intro to Cassandra for Developers

Housekeeping
Courses: youtube.com/DataStaxDevs Runtime: dtsx.io/workshop

YouTube

Twitch

Questions: bit.ly/cassandra-workshop Quizz: menti.com

Discord

YouTube

2
Achievement Unlocked! - “Introduction to Cassandra”
Homework
==
Fully managed Cassandra
Without the ops!
DataStax Astra

Global Scale No Operations 25 Gig Free Tier

Put your data where you need it Launch a database in the cloud
Eliminate the overhead to install,
without compromising performance, with a few clicks, no credit card
operate, and scale Cassandra.
availability, or accessibility. required.
menti.com
Apache Cassandra™ = NoSQL Distributed Database

1 Installation = 1 NODE
NODE ✔ Capacity = ~ 2-4TB
✔ Throughput = LOTS Tx/sec/core
NODE NODE

DataCenter | Ring

NODE NODE
Communication:
✔ Gossiping

NODE NODE
Apache Cassandra™ = NoSQL Distributed Database

- Big Data Ready

- Highest Availability
- Geographical Distribution
- Read/Write Performance
- Vendor Independent
Data is Distributed
Country City Population

USA New York 8.000.000

USA Los Angeles 4.000.000
FR Paris 2.230.000
DE Berlin 3.350.000
UK London 9.200.000
AU Sydney 4.900.000
DE Nuremberg 500.000
CA Toronto 6.200.000
CA Montreal 4.200.000
FR Toulouse 1.100.000
JP Tokyo 37.430.000
IN Mumbai 20.200.000

Partition Key
Data is Distributed
USA New York 8.000.000
Country City Population
USA Los Angeles 4.000.000

FR Paris 2.230.000
DE Berlin 3.350.000
FR Toulouse 1.100.000
DE Nuremberg 500.000

UK London 9.200.000 JP Tokyo 37.430.000

AU Sydney 4.900.000 CA Toronto 6.200.000

IN Mumbai 20.200.000 CA Montreal 4.200.000
Data is Replicated

RF = 3 83 17

Replication Factor 3
means that every
row is stored on 3
different nodes
67 33

50
Replication within the Ring

0
59 (data)
83 17

RF = 3

67 33

50
Replication within the Ring

83 59 (data)
17

RF = 3

67 33

50
Replication within the Ring

59 (data)
0

59 (data)
83 17

RF = 3

59 (data)
67 33

50
Node Failure

59 (data)
0

83 17 Hint
59 (data)
RF = 3

59 (data)
67 33

50
Node Failure Recovered

59 (data)
0

83 17 Hint
59 (data)
RF = 3

59 (data)
67 33

50
Immediate Consistency – A Better Way

Client Client

Write Read
CL = QUORUM CL = QUORUM
Data Distributed Everywhere

• Geographic Distribution • Hybrid-Cloud and Multi-Cloud

On-premise
Understanding Use Cases
High Throughput Heavy Writes Event Streaming Log Analytics
Scalability
High Volume Heavy Reads Internet of Things Other Time Series

No Data Loss Caching Pricing

Availability Mission-Critical
Always-on Market Data Inventory

Global Presence Banking Retail

Distributed Compliance /
GDPR Tracking / Customer
Workload Mobility
Logistics Experience

Modern Cloud API Layer Hybrid-cloud

Cloud-native Applications
Enterprise Data
Multi-cloud
Layer
https://github.com/DataStax-Academy
/Intro-to-Cassandra-for-Developers
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Data Structure: a Cell

An intersection of a row
and a column, stores data.
Data Structure: a Row

A single, structured
data item in a table.
Data Structure: a Partition

A group of rows having the ID First Name Last Name Department

same partition token, a base
unit of access in Cassandra. 1 John Doe Wizardry

IMPORTANT: stored together, all 399 Marisha Chapez Wizardry

the rows are guaranteed to be
neighbors. 415 Maximus Flavius Wizardry
Data Structure: a Table

ID First Name Last Name Department

1 John Doe Wizardry

A group of columns and
rows storing partitions. 2 Mary Smith Dark Magic

3 Patrick McFadin DevRel

Data Structure: Overall
Keyspace columns

Table ● Tabular data model, with one twist

● Tables are organized in rows and columns
- - - -
- - - ● Groups of related rows called partitions are
x stored together on the same node (or nodes)
partitions - - -
● Each row contains a partition key
- - - ○ One or more columns that are hashed to
y - - - determine which node(s) store that data
- - -

z - - -
rows

Partition key
Example Data: Users organized by city

Keyspace killrvideo

Table users_by_city
Last First
City Address Email
Name Name
Hellson Kevin 23 Jackson St. [email protected]
Phoenix Lastfall Norda 3 Stone St [email protected]
partitions Smith Jana 3 Stone St [email protected]
Franklin George 2 Star St [email protected]
rows
Seattle Jackson Jane 2 Star St [email protected]
Jasons Judy 2 StarSt [email protected]

Partition key column Clustering columns Data columns

Creating a Table in CQL

keyspace table

CREATE TABLE killrvideo.users_by_city (

city text,
column last_name text,
deﬁnitions first_name text,
address text,
email text,
PRIMARY KEY ((city), last_name, first_name, email));

Primary key Partition key Clustering columns

Primary Key CREATE TABLE killrvideo.users_by_city (
city text,
An identiﬁer for a row. Consists last_name text,
of at least one Partition Key and first_name text,
address text,
zero or more Clustering email text,
Columns. PRIMARY KEY ((city), last_name, first_name, email));

MUST ENSURE UNIQUENESS.

MAY DEFINE SORTING. Partition key Clustering columns

Good Examples:

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY (user_id);

Bad Example:
PRIMARY KEY ((city), last_name, first_name);
Partition Key CREATE TABLE killrvideo.users_by_city (
city text,
An identiﬁer for a partition. last_name text,
Consists of at least one column, first_name text,
address text,
may have more if needed email text,
PRIMARY KEY ((city), last_name, first_name, email));
PARTITIONS ROWS.

Partition key Clustering columns

Good Examples:

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), comment_id);

Bad Example:
PRIMARY KEY ((sensor_id), logged_at);
Clustering Column(s) CREATE TABLE killrvideo.users_by_city (
city text,
Used to ensure uniqueness and last_name text,
sorting order. Optional. first_name text,
address text,
email text,
PRIMARY KEY ((city), last_name, first_name, email));

Partition key Clustering columns

PRIMARY KEY ((city), last_name, first_name); Not Unique

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY ((video_id), comment_id); Not Sorted

PRIMARY KEY ((video_id), created_at, comment_id);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big partitions
● Avoid hot partitions

Example: open a video? Get the comments in a single query!

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((comment_id), created_at);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big partitions
● Avoid hot partitions

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

● Up to 2 billion cells per partition

● Up to ~100k rows in a partition
● Up to ~100MB in a Partition
The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big and constantly growing partitions!
● Avoid hot partitions

Example: a huge IoT infrastructure, hardware all over

● Sensor ID: UUID
the world, different sensors reporting their state
● Timestamp: Timestamp
every 10 seconds. Every sensor reports its UUID,
● Value: ﬂoat
timestamp of the report, sensor’s value.

PRIMARY KEY ((sensor_id), reported_at);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together

BUCKETING
● Avoid big and constantly growing partitions!
● Avoid hot partitions

Example: a huge IoT infrastructure, hardware all over

● Sensor ID: UUID
the world, different sensors reporting their state
● MonthYear: Integer or String
every 10 seconds. Every sensor reports its UUID,
● Timestamp: Timestamp
timestamp of the report, sensor’s value.
● Value: ﬂoat

PRIMARY KEY ((sensor_id), reported_at);

PRIMARY KEY ((sensor_id, month_year), reported_at);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big partitions
● Avoid hot partitions

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

https://github.com/DataStax-Academy/Intro-t
o-Cassandra-for-Developers#2-create-a-table
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Normalization
Employees
“Database normalization is the process of
structuring a relational database in accordance userId deptId ﬁrstName lastName
with a series of so-called normal forms in order
to reduce data redundancy and improve data 1 1 Edgar Codd
integrity. It was ﬁrst proposed by Edgar F. Codd
as part of his relational model.” 2 1 Raymond Boyce

Departments

departmentId department
PROS: Simple write, Data Integrity
CONS: Slow read, Complex Queries 1 Engineering

2 Math

41
Denormalization
“Denormalization is a strategy used on a Employees
database to increase performance. In
computing, denormalization is the process of userId ﬁrstName lastName department
trying to improve the read performance of a
database, at the expense of losing some write 1 Edgar Codd Engineering
performance, by adding redundant copies of
data” 2 Raymond Boyce Engineering

3 Sage Lahja Math

PROS: Quick Read, Simple Queries 4 Juniper Jones Botany

CONS: Multiple Writes, Manual Integrity

42
Relational Data Modelling
Data
1. Analyze raw data

2. Identify entities, their properties

and relations

3. Design tables, using

normalization and foreign keys. Models

4. Use JOIN when doing queries to

join normalized data from
multiple tables

Application
NoSQL Data Modelling
Application
1. Analyze user behaviour
(customer ﬁrst!)

2. Identify workﬂows, their

dependencies and needs

3. Deﬁne Queries to fulﬁll these Models

workﬂows

4. Knowing the queries, design tables,

using denormalization.

5. Use BATCH when inserting or

updating denormalized data of Data
multiple tables
Designing Process: Step by Step
Entities & Relationships

Queries
Designing Process:
Conceptual Data Model
Designing Process:
Application Workﬂow

Use-Case I:
● A User opens a Proﬁle

WF2: Find comments related to target user using its identiﬁer, get most recent ﬁrst

Use-Case II:
● A User opens a Video Page

WF1: Find comments related to target video using its identiﬁer, most recent ﬁrst
Designing Process:
Mapping

Query I: Find comments posted for a user comments_by_user

with a known id (show most recent ﬁrst)

Query II: Find comments for a video with a comments_by_video

known id (show most recent ﬁrst)
Designing Process:
Mapping

SELECT * FROM comments_by_user comments_by_user

WHERE userid = <some UUID>

SELECT * FROM comments_by_video comments_by_video

WHERE videoid = <some UUID>
Designing Process:
Logical Data Model

comments_by_user comments_by_video

userid K videoid K
creationdate creationdate C
↑
C
↑
commentid C↑ commentid C↑
videoid userid
comment comment
Designing Process:
Physical Data Model

comments_by_user comments_by_video

userid UUID K videoid UUID K

commentid TIMEUUID C
↑ commentid TIMEUUID C
↑
videoid UUID userid UUID

comment TEXT comment TEXT

Designing Process:
Schema DDL
CREATE TABLE IF NOT EXISTS comments_by_user (
userid uuid,
commentid timeuuid,
videoid uuid,
comment text,
PRIMARY KEY ((userid), commentid)
) WITH CLUSTERING ORDER BY (commentid DESC);

CREATE TABLE IF NOT EXISTS comments_by_video (

videoid uuid,
commentid timeuuid,
userid uuid,
comment text,
PRIMARY KEY ((videoid), commentid)
) WITH CLUSTERING ORDER BY (commentid DESC);
https://github.com/DataStax-Academy/Intro-to-Cas
sandra-for-Developers#3-execute-crud-operations
menti.com
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Homework
MORE LEARNING!!!!
Developer site: datastax.com/dev

● Developer Stories
● New hands-on learning scenarios with
Katacoda
● Try it Out
● Cassandra Fundamentals
● https://www.datastax.com/learn/cassandra-funda
mentals
● New Data Modeling course
https://www.datastax.com/dev/modeling

Classic courses available at DataStax Academy

✔ Academy.datastax.com

✔ datastax.com/dev

✔ community.datastax.com

✔ Datastax Developers
YouTube Channel

58
Weekly Workshops https://www.datastax.com/workshops

59
Join our 10k Discord Community https://bit.ly/cassandra-workshop
The Fellowship of the RINGS

60
Thank you!

Cassandra As Used by Facebook
100% (1)
Cassandra As Used by Facebook
12 pages
Cassandra Presentation Final
100% (3)
Cassandra Presentation Final
71 pages
FortiAuthenticator Student Guide-Online
67% (3)
FortiAuthenticator Student Guide-Online
455 pages
Business Analytics Concepts and Frameworks-Module1
No ratings yet
Business Analytics Concepts and Frameworks-Module1
5 pages
Auditing Internal Control Over Financial Reporting - Chapter 7
100% (3)
Auditing Internal Control Over Financial Reporting - Chapter 7
25 pages
Database Management System - DBMS (COMPUTER SCIENCE) Video Lecture For GATE Preparation (CS IT MCA)
No ratings yet
Database Management System - DBMS (COMPUTER SCIENCE) Video Lecture For GATE Preparation (CS IT MCA)
3 pages
Normalization 1st To 5th NF With Example
No ratings yet
Normalization 1st To 5th NF With Example
33 pages
Cassandra Quick Guide
No ratings yet
Cassandra Quick Guide
60 pages
Introduction To NOSQL and Cassandra: @rantav @outbrain
No ratings yet
Introduction To NOSQL and Cassandra: @rantav @outbrain
60 pages
Cassandra
No ratings yet
Cassandra
7 pages
Adobe Connect Installation and Configuration Guide
No ratings yet
Adobe Connect Installation and Configuration Guide
64 pages
Cassandra PPT Final
No ratings yet
Cassandra PPT Final
23 pages
System Analysis and Design Process Modelling
No ratings yet
System Analysis and Design Process Modelling
57 pages
Chapter 7
No ratings yet
Chapter 7
48 pages
Institute of Accountancy Arusha (IAA)
100% (1)
Institute of Accountancy Arusha (IAA)
23 pages
Learn Cassandra
100% (2)
Learn Cassandra
37 pages
Deep Dive With Cassandra
No ratings yet
Deep Dive With Cassandra
29 pages
Cassandra
No ratings yet
Cassandra
25 pages
Key - Value - Database - (2) (1) (Read-Only)
No ratings yet
Key - Value - Database - (2) (1) (Read-Only)
48 pages
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
No ratings yet
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
83 pages
Instaclustr by NetApp 10 Rules For Managing Cassandra White Paper 3 20mar24
No ratings yet
Instaclustr by NetApp 10 Rules For Managing Cassandra White Paper 3 20mar24
11 pages
COmp INtfc Code
No ratings yet
COmp INtfc Code
21 pages
Cassandra Complete Notes
No ratings yet
Cassandra Complete Notes
5 pages
Bi 2
No ratings yet
Bi 2
8 pages
Cassandra Data Model Big Data Seminar
No ratings yet
Cassandra Data Model Big Data Seminar
8 pages
Wide-Column Stores: Big Data Management Phil Bartie
No ratings yet
Wide-Column Stores: Big Data Management Phil Bartie
46 pages
Class 3 Cassandra
No ratings yet
Class 3 Cassandra
64 pages
Learning Apache Cassandra - Sample Chapter
No ratings yet
Learning Apache Cassandra - Sample Chapter
20 pages
Cassandra: Decentralized Storage System
No ratings yet
Cassandra: Decentralized Storage System
37 pages
Cassandra Design Patterns - Sample Chapter
No ratings yet
Cassandra Design Patterns - Sample Chapter
32 pages
Examview Setup Information - Notes For Et - Sept2013
No ratings yet
Examview Setup Information - Notes For Et - Sept2013
3 pages
Cassandra Data Model
No ratings yet
Cassandra Data Model
17 pages
4 - Key-Value Storage
No ratings yet
4 - Key-Value Storage
109 pages
Dzone Refcard 153 Apache Cassandra 2020
No ratings yet
Dzone Refcard 153 Apache Cassandra 2020
11 pages
Lec 17
No ratings yet
Lec 17
21 pages
Cassandra Data Modeling Best Practices
No ratings yet
Cassandra Data Modeling Best Practices
57 pages
Cassandra CQL Commands
No ratings yet
Cassandra CQL Commands
16 pages
Oracle Partitioning For Developers
No ratings yet
Oracle Partitioning For Developers
70 pages
Collabera Corporate
No ratings yet
Collabera Corporate
13 pages
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
47 pages
Introduction to Cassandra Basics
No ratings yet
Introduction to Cassandra Basics
27 pages
NOSQL Databases
No ratings yet
NOSQL Databases
19 pages
Cassandra
No ratings yet
Cassandra
5 pages
Ch3 Nosql Wordpress
No ratings yet
Ch3 Nosql Wordpress
15 pages
02 CQL - Solution
No ratings yet
02 CQL - Solution
3 pages
CICS Basics
No ratings yet
CICS Basics
25 pages
Cassandra
No ratings yet
Cassandra
31 pages
Become A Super Modeler
No ratings yet
Become A Super Modeler
29 pages
Casandra
No ratings yet
Casandra
57 pages
PR 5 - No SQL
No ratings yet
PR 5 - No SQL
9 pages
Deep Dive Dynamo DB
No ratings yet
Deep Dive Dynamo DB
88 pages
Apache Cassandra Tutorial
No ratings yet
Apache Cassandra Tutorial
7 pages
Handbook For Technical Recruitment
No ratings yet
Handbook For Technical Recruitment
18 pages
Knockout Js
100% (1)
Knockout Js
18 pages
Intro To NoSQL
No ratings yet
Intro To NoSQL
18 pages
Features of Cassandra
No ratings yet
Features of Cassandra
6 pages
BDA
No ratings yet
BDA
9 pages
Bapi and Badi
No ratings yet
Bapi and Badi
8 pages
Apache Cassandra Nosql SonuJha 04
No ratings yet
Apache Cassandra Nosql SonuJha 04
14 pages
Intro To Cassandra and CQL
No ratings yet
Intro To Cassandra and CQL
29 pages
IoT USSD API Developers Guide
No ratings yet
IoT USSD API Developers Guide
27 pages
Distributed Data Store
No ratings yet
Distributed Data Store
11 pages
4 Key Value
No ratings yet
4 Key Value
30 pages
Cassandra Data Modeling Guide
No ratings yet
Cassandra Data Modeling Guide
50 pages
Rangkum Handson
No ratings yet
Rangkum Handson
20 pages
Lecture7 Cassandra Animations
No ratings yet
Lecture7 Cassandra Animations
20 pages
09b Cassandra Slides
No ratings yet
09b Cassandra Slides
26 pages
Module 4
No ratings yet
Module 4
22 pages
Apache Cassandra Database - Instaclustr
No ratings yet
Apache Cassandra Database - Instaclustr
8 pages
App Ache
No ratings yet
App Ache
55 pages
Chapter 4 - What Is System Testing
No ratings yet
Chapter 4 - What Is System Testing
4 pages
Cassandra - Module5
No ratings yet
Cassandra - Module5
37 pages
Whitepaper - Data Modeling in Apache Cassandra
No ratings yet
Whitepaper - Data Modeling in Apache Cassandra
21 pages
DSX Developer Ebook4 FINAL PDF
No ratings yet
DSX Developer Ebook4 FINAL PDF
27 pages
Cassandra Database Overview
No ratings yet
Cassandra Database Overview
37 pages
Apache Cassandra: Database
No ratings yet
Apache Cassandra: Database
55 pages
Qlik Analytics Introduction - Power
No ratings yet
Qlik Analytics Introduction - Power
25 pages
Pivot Exercise 2
No ratings yet
Pivot Exercise 2
3 pages
Kofax Insight: Installation Guide
No ratings yet
Kofax Insight: Installation Guide
58 pages
Lib Sys
No ratings yet
Lib Sys
48 pages
Annual Report Page-3
No ratings yet
Annual Report Page-3
1 page
Garima Bhatt: Personal Profile
No ratings yet
Garima Bhatt: Personal Profile
1 page
Microsoft Access 2007 Tutorial Guide
100% (1)
Microsoft Access 2007 Tutorial Guide
4 pages
New Released Microsoft 70-532 Dumps PDF Free Download From Braindump2go (1-10)
No ratings yet
New Released Microsoft 70-532 Dumps PDF Free Download From Braindump2go (1-10)
10 pages
Database Systems Lab 3 Key Constraints
No ratings yet
Database Systems Lab 3 Key Constraints
4 pages
2020 Microsoft Azure Fundamentals AZ 900
No ratings yet
2020 Microsoft Azure Fundamentals AZ 900
7 pages
6.integration Testing
No ratings yet
6.integration Testing
6 pages
SAP UI5 Component Load Error
No ratings yet
SAP UI5 Component Load Error
4 pages

Intro To Cassandra For Developers

Uploaded by

Intro To Cassandra For Developers

Uploaded by

Intro to Cassandra for Developers

Questions: bit.ly/cassandra-workshop Quizz: menti.com

Global Scale No Operations 25 Gig Free Tier

- Big Data Ready

USA New York 8.000.000

UK London 9.200.000 JP Tokyo 37.430.000

AU Sydney 4.900.000 CA Toronto 6.200.000

• Geographic Distribution • Hybrid-Cloud and Multi-Cloud

No Data Loss Caching Pricing

Global Presence Banking Retail

Modern Cloud API Layer Hybrid-cloud

2. The Art of Data Modelling

2. The Art of Data Modelling

A group of rows having the ID First Name Last Name Department

IMPORTANT: stored together, all 399 Marisha Chapez Wizardry

ID First Name Last Name Department

1 John Doe Wizardry

3 Patrick McFadin DevRel

Table ● Tabular data model, with one twist

Partition key column Clustering columns Data columns

CREATE TABLE killrvideo.users_by_city (

Primary key Partition key Clustering columns

MUST ENSURE UNIQUENESS.

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY (user_id);

Partition key Clustering columns

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), comment_id);

Partition key Clustering columns

PRIMARY KEY ((city), last_name, first_name); Not Unique

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY ((video_id), comment_id); Not Sorted

PRIMARY KEY ((video_id), created_at, comment_id);

Example: open a video? Get the comments in a single query!

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((comment_id), created_at);

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

● Up to 2 billion cells per partition

Example: a huge IoT infrastructure, hardware all over

PRIMARY KEY ((sensor_id), reported_at);

Example: a huge IoT infrastructure, hardware all over

PRIMARY KEY ((sensor_id), reported_at);

PRIMARY KEY ((sensor_id, month_year), reported_at);

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

2. The Art of Data Modelling

3 Sage Lahja Math

PROS: Quick Read, Simple Queries 4 Juniper Jones Botany

2. Identify entities, their properties

3. Design tables, using

4. Use JOIN when doing queries to

2. Identify workﬂows, their

3. Deﬁne Queries to fulﬁll these Models

4. Knowing the queries, design tables,

5. Use BATCH when inserting or

Query I: Find comments posted for a user comments_by_user

Query II: Find comments for a video with a comments_by_video

SELECT * FROM comments_by_user comments_by_user

WHERE userid = <some UUID>

SELECT * FROM comments_by_video comments_by_video

userid UUID K videoid UUID K

comment TEXT comment TEXT

CREATE TABLE IF NOT EXISTS comments_by_video (

2. The Art of Data Modelling

Classic courses available at DataStax Academy

You might also like