0% found this document useful (0 votes)

163 views54 pages

Duckdb Parallelism

Uploaded by

yoonghm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

163 views54 pages

Duckdb Parallelism

Uploaded by

yoonghm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

Mark Raasveldt

Parallel Quacking
Parallel Quacking

▸ When building DuckDB we have mostly

focused on building a functional system
▸ Avoid premature optimization
▸ Avoid adding optimizations that prevent
adding features
Parallel Quacking

▸ Suddenly people are benchmarking our system

▸ Including benchmarks in research papers
▸ Yikes!

▸ We haven’t exactly spend a lot of time

optimizing…
Parallel Quacking

▸ We are now pretty happy with functionality

▸ Window functions, subqueries, collations,
(recursive) CTEs, Parquet/Pandas/CSV
readers, …
▸ Maybe we should start optimizing!
Parallel Quacking

▸ DuckDB is currently single-threaded

▸ Parallelism is an obvious performance boost

▸ More importantly: parallelism requires a

structural change to the code
▸ Optimizations need to account for parallelism
▸ Optimizing a single-threaded HT is pointless if
we have to throw it away once we add
parallelism!
Parallel Quacking

▸ Parallelism is actually our oldest open issue!

▸ Created one month after the initial commit

▸ So it’s about time :)

DBMS Parallelism

▸ Short intro to DBMS parallelism

▸ DBMS have two types of parallelism
▸ Inter-query and intra-query parallelism

▸ Inter-query: multiple different queries

can be executed in parallel
▸ Intra-query: a single query can be
parallelized
DBMS Parallelism

▸ Most systems have inter-query

▸ We already had this

▸ Most useful for OLTP systems

▸ Many concurrent clients requests, etc
DBMS Parallelism

▸ Intra-query is not part of most OLTP

systems
▸ e.g. MySQL/PostgreSQL/SQLite
▸ Not useful for small queries

▸ Only useful for complex queries

▸ Aka OLAP systems
DBMS Parallelism

▸ Exchange operator: original way of

doing parallelism
▸ Parallelism is encapsulated in the
exchange operator
▸ All other ops are unaware of parallelism
▸ Easy to bolt onto existing systems

[1993] Encapsulation of Parallelism and

Architecture-Independence in Extensible
Database Query Execution

Goetz Graefe et al.

DBMS Parallelism
DBMS Parallelism

▸ MonetDB uses system similar to exchange

operator
▸ Individual ops are parallelism-unaware

▸ Data is partitioned by mitosis (mergetable?)

▸ Ops execute sequentially on partitions
▸ Result is combined by mat.pack
DBMS Parallelism

▸ Exchange operator works to parallelize queries

▸ It is nice to bolt on to an existing system
▸ Don’t need to change any operators!

▸ But has partitioning/merging overhead…

▸ Works well for certain queries1, not for many
others
▸ 1ungrouped aggregates or aggregates with low
amount of groups
Morsel-Driven Parallelism

▸ Alternative: Morsel-driven parallelism

▸ Parallelism-aware operators
▸ Query is divided into pipelines
▸ Those pipelines are executed in parallel

[2014] Morsel-Driven Parallelism: A

NUMA-Aware Query Evaluation
Framework for the Many-Core Age

Viktor Leis et al.

Morsel-Driven Parallelism

SELECT …
FROM S
JOIN R USING (A) 3: Probe HTs and output result
JOIN T USING (B); (depends on 1 and 2)

1: HT Build “T”

2: HT Build “S”
Morsel-Driven Parallelism

SELECT …
FROM S
JOIN R USING (A)
JOIN T USING (B);

HT Build “T”

HT Build “S”

▸ HT builds of S and T can be trivially parallelized

▸ No shared data
▸ Limited parallelizability: depends on Q complexity…
Morsel-Driven Parallelism

▸ Need to parallelize inside a pipeline

▸ How to do that?
▸ Contention happens at endpoints
▸ Scan of T
▸ HT build at join HT Build “T”

▸ Use parallelism-aware operators at endpoints

▸ The rest of the operators (HT probe, projection,
filter, etc…) don't need to be aware
Morsel-Driven Parallelism

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

NUMA-Aware Query Evaluation
Framework for the Many-Core Age

Viktor Leis et al.

Morsel-Driven Parallelism

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

NUMA-Aware Query Evaluation
32 cores, 64 hardware threads Framework for the Many-Core Age

Viktor Leis et al.

Morsel-Driven Parallelism

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

NUMA-Aware Query Evaluation
32 cores, 64 hardware threads Framework for the Many-Core Age

Viktor Leis et al.

Morsel-Driven Parallelism

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

NUMA-Aware Query Evaluation
32 cores, 64 hardware threads Framework for the Many-Core Age

Viktor Leis et al.

Morsel-Driven Vegetable Soup

▸ Morsel-driven parallelism seems like the way to go

▸ How can we add it to our vegetable soup?

Parallelism in DuckDB

▸ DuckDB uses a pull-based volcano execution model

▸ "Vector Volcano”

▸ Every operator implements a GetChunk operator

▸ Recursively calls GetChunk on children
▸ Until we reach a data source (e.g. table scan)
Parallelism in DuckDB

▸ BuildHashTable: pull everything from RHS (build-side)

▸ ProbeHashTable: pull single chunk from LHS (probe
side)
Parallelism in DuckDB

▸ Have to split up building from probing

▸ Create individual pipelines
▸ Design interface that allows for parallel-aware execution
Parallelism in DuckDB

▸ Contention is in the source and sink of a pipeline

▸ Most difficult contention is in the sink
▸ Splitting up a scan is relatively simple
Parallelism in DuckDB

▸ Sink Interface
▸ Sink has two states
▸ Global state: single state per sink
▸ Local state: single state per thread
▸ Actual content depends on the operator
Parallelism in DuckDB

▸ Sink Interface
▸ Sink takes as input the two states + a DataChunk
▸ Called repeatedly until the source data is exhausted
Parallelism in DuckDB

▸ Sink Interface
▸ Combine is called after source of a single thread is
exhausted

▸ Combine is the final chance to merge any changes

in the local sink state to the global state
Parallelism in DuckDB

▸ Sink Interface
▸ Finalize is called after all tasks related to the sink
are completed
Parallelism in DuckDB

▸ Example: Ungrouped Aggregate

▸ Global state holds the aggregate result, and a lock
Parallelism in DuckDB

▸ Example: Ungrouped Aggregate

▸ Local state holds a thread-local aggregate, and
some intermediates
Parallelism in DuckDB

▸ Example: Ungrouped Aggregate

▸ Sink: Aggregate into thread-local aggregation
Parallelism in DuckDB

▸ Example: Ungrouped Aggregate

▸ Combine: Merge local state into global state
Parallelism in DuckDB

▸ Example: Ungrouped Aggregate

▸ Finalize: Nothing, we are done
▸ (both Combine and Finalize are optional)
Parallelism in DuckDB

▸ Splitting up scans
▸ Splitting up scans is generally not very difficult
▸ But we have multiple types of scans
▸ Base table, parquet, CSV, aggregate HT, etc…
▸ How to split up depends on scan type
Parallelism in DuckDB

▸ Interface for parallel scans:

▸ One task is created for every invoked callback

▸ Implementation is optional
▸ No implementation -> scan will not be parallelized
Parallelism in DuckDB

▸ Currently only implemented for base table

▸ One task for every 100 vectors (102,400 tuples)

▸ Parquet/Pandas is not very complicated

▸ CSV can also benefit…
▸ Future work!
Parallelism in DuckDB

▸ Creating the pipelines

▸ Created by a single traversal of the query tree
▸ Encounter a pipeline breaker: create a new
pipeline
Parallelism in DuckDB

Encounter hash join: create build pipeline in RHS*

SELECT … and create a dependency in main pipeline
FROM S
JOIN R USING (A)
JOIN T USING (B);

* This image is taken from HyPer which builds on the LHS - we build on the RHS.
Is there a standard? Should we switch this? Is it even important?
Parallelism in DuckDB

SELECT …
FROM S
JOIN R USING (A)
Another hash join: create another
JOIN T USING (B); build pipeline and dependency
Parallelism in DuckDB

TPC-H Q1

P1 (depends on P2)
Scans the aggregate HT!

This 0 is a bug in our profiler with parallel execution atm, TODO

Parallelism in DuckDB

▸ Notes on parallelism
▸ The final pipeline (i.e. the one that outputs
results) is not parallelized

▸ Doesn’t matter for TPC-H (there is always a Top-N

or ORDER BY…)

▸ But can definitely matter for other queries!

▸ We can push a “materialize” operator that

materializes in parallel

▸ Future work!
Parallelism in DuckDB

▸ Notes on load balancing

▸ Pipelines are split into tasks
▸ Tasks are scheduled in a concurrent queue
▸ Worker threads work on these tasks in scheduled
order

▸ Except the calling thread: this thread works on its

own query

▸ Short queries will not have to wait for long queries

▸ Every query has at least one thread working on it
Parallelism in DuckDB

▸ NUMA Awareness
▸ TODO :)
Preliminary Results

▸ Results
▸ Before we implemented splitting of scans we were
curious

▸ How much does TPC-H benefit from inter-pipeline

parallelization?
Preliminary Results

▸ Small speedup in some queries

▸ Most queries are dominated by a single pipeline!

* Actually 3 threads, due to an off-by-one :)

Preliminary Results

▸ Preliminary results (including splitting of pipelines)

▸ Notes
▸ We did not implement a good aggregate HT yet!
▸ Currently global HT that is locked on every sink

▸ Join HT/scan also have a (low) amount of contention

▸ Did not have much time to look at it yet
▸ This was all finished last Thursday :)
Preliminary Results

▸ Preliminary results
Preliminary Results

▸ Preliminary results
Preliminary Results
▸ Q1
Parallel Sequential
Preliminary Results
▸ Q18 Sequential
Preliminary Results
▸ Q18 Parallel
Future Work

▸ Future Work
▸ Rework aggregate hash table
▸ More profiling of contention (specifically in scans)
▸ Parallel window functions, ORDER BY, Top N…
▸ Parallel Parquet/CSV/Pandas scans
▸ Expand profiler to better display parallelism/
pipelines

Unit 5 Parallel and Distributed Databases
No ratings yet
Unit 5 Parallel and Distributed Databases
22 pages
Wind Power Plant Testing and Commissioning
100% (1)
Wind Power Plant Testing and Commissioning
27 pages
MonetDB User Guide
No ratings yet
MonetDB User Guide
49 pages
RDS-PS Wind Farm Systems Overview
No ratings yet
RDS-PS Wind Farm Systems Overview
1 page
MonetDB Server Reference Manual
100% (2)
MonetDB Server Reference Manual
309 pages
China 2009-2010 PDF
100% (2)
China 2009-2010 PDF
98 pages
Lessons in Electric Circuits, Volume I - DC
No ratings yet
Lessons in Electric Circuits, Volume I - DC
560 pages
Ubuntu 1404 Server Guide
No ratings yet
Ubuntu 1404 Server Guide
380 pages
Vestas V112-3.0 MW Wind Turbine
100% (1)
Vestas V112-3.0 MW Wind Turbine
20 pages
Wind Power Plant SCADA and Controls
No ratings yet
Wind Power Plant SCADA and Controls
7 pages
p64 Stonebraker PDF
No ratings yet
p64 Stonebraker PDF
8 pages
Guide For RDS-PS
No ratings yet
Guide For RDS-PS
9 pages
Query Parallelism
No ratings yet
Query Parallelism
8 pages
Types of Database Parallelism
No ratings yet
Types of Database Parallelism
2 pages
Mark Raasveldt & Hannes Mühleisen: Duckdb
No ratings yet
Mark Raasveldt & Hannes Mühleisen: Duckdb
38 pages
2020 RDS General Introduction 2020 06 25 - Handout
100% (1)
2020 RDS General Introduction 2020 06 25 - Handout
35 pages
Advanced Parallel Databases
No ratings yet
Advanced Parallel Databases
136 pages
AN4503.Power Management For Kinetis
No ratings yet
AN4503.Power Management For Kinetis
69 pages
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
No ratings yet
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
70 pages
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
No ratings yet
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
114 pages
ParallelDBs PDF
No ratings yet
ParallelDBs PDF
23 pages
Database Management Systems: Unit 4 - Parallel DBMS
No ratings yet
Database Management Systems: Unit 4 - Parallel DBMS
14 pages
TDD: Topics in Distributed Databases: Parallel Database Management Systems
No ratings yet
TDD: Topics in Distributed Databases: Parallel Database Management Systems
38 pages
Inter and Intra Query Parallelism
No ratings yet
Inter and Intra Query Parallelism
1 page
Unit 2adtnotes
No ratings yet
Unit 2adtnotes
74 pages
Database Parallelism Essentials
No ratings yet
Database Parallelism Essentials
46 pages
LN 2
No ratings yet
LN 2
33 pages
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
No ratings yet
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
23 pages
Advanced Parallel DB Systems
No ratings yet
Advanced Parallel DB Systems
30 pages
Oracle Parallel Execution Guide
No ratings yet
Oracle Parallel Execution Guide
62 pages
Inter-Query Parallelism: Interquery and Intraquery Parallelism in Parallel Database
No ratings yet
Inter-Query Parallelism: Interquery and Intraquery Parallelism in Parallel Database
2 pages
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
DuckDB in Action MEAP v01 Chptrs 1to3 MotheDuck
100% (1)
DuckDB in Action MEAP v01 Chptrs 1to3 MotheDuck
71 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Parallel Database Systems Overview
100% (1)
Parallel Database Systems Overview
141 pages
Database Systems Comparison Guide
No ratings yet
Database Systems Comparison Guide
27 pages
Parallel Database
No ratings yet
Parallel Database
27 pages
Parallel Execution in Oracle
No ratings yet
Parallel Execution in Oracle
17 pages
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
No ratings yet
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
27 pages
Analyzing The Performance of CC and Debugging Opengl Es Frames On Mainstream x86 and Arm Android Devices
No ratings yet
Analyzing The Performance of CC and Debugging Opengl Es Frames On Mainstream x86 and Arm Android Devices
14 pages
BTLE Tech Training From BTSIG
No ratings yet
BTLE Tech Training From BTSIG
420 pages
DWHM 1
No ratings yet
DWHM 1
12 pages
Cap 5
No ratings yet
Cap 5
50 pages
Breaking The Memory Wall in MonetDB
No ratings yet
Breaking The Memory Wall in MonetDB
22 pages
Parallel Database Systems Guide
No ratings yet
Parallel Database Systems Guide
11 pages
02 Lecf 13 Map Reduce
No ratings yet
02 Lecf 13 Map Reduce
81 pages
Column Oriented Database
No ratings yet
Column Oriented Database
16 pages
DuckDB: Fast In-Memory OLAP Database
No ratings yet
DuckDB: Fast In-Memory OLAP Database
4 pages
8-Parallel Nhom5
No ratings yet
8-Parallel Nhom5
59 pages
Dbms
No ratings yet
Dbms
14 pages
Parallel and Distributed DBMS Techniques
No ratings yet
Parallel and Distributed DBMS Techniques
15 pages
Para Distr Query Processing Notes
No ratings yet
Para Distr Query Processing Notes
7 pages
2 Parallel Databases
No ratings yet
2 Parallel Databases
44 pages
Survey and Comparison of Open Source Time Series Databases
No ratings yet
Survey and Comparison of Open Source Time Series Databases
20 pages
EB3053 New
No ratings yet
EB3053 New
22 pages
Module1 ADBMS
No ratings yet
Module1 ADBMS
99 pages
BCSE412L - Parallel Computing 01
No ratings yet
BCSE412L - Parallel Computing 01
27 pages
Advanced DBMS Tutorials & Quizzes
No ratings yet
Advanced DBMS Tutorials & Quizzes
6 pages
UNIT-3: Introduction To Parallel Database and I/O Parallelism
No ratings yet
UNIT-3: Introduction To Parallel Database and I/O Parallelism
52 pages
MonetDB/X100: Fast Column-Store Overview
No ratings yet
MonetDB/X100: Fast Column-Store Overview
62 pages
Duckdb Parallelism
No ratings yet
Duckdb Parallelism
54 pages
Monetdb/X100: Hyper-Pipelining Query Execution: Peter Boncz, Marcin Zukowski, Niels Nes
No ratings yet
Monetdb/X100: Hyper-Pipelining Query Execution: Peter Boncz, Marcin Zukowski, Niels Nes
13 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
Lecture 3 Distributed and Dynamic Indexing
No ratings yet
Lecture 3 Distributed and Dynamic Indexing
13 pages
Second Unit ADBMS
No ratings yet
Second Unit ADBMS
53 pages
Control Engineering
No ratings yet
Control Engineering
28 pages
Rdbms and Types
No ratings yet
Rdbms and Types
19 pages
Unit No.4 Parallel Database
No ratings yet
Unit No.4 Parallel Database
32 pages
14 Queryexecution2
No ratings yet
14 Queryexecution2
47 pages
Parallel DBMS Vendors
No ratings yet
Parallel DBMS Vendors
14 pages
Principal Software Engineer Resume
No ratings yet
Principal Software Engineer Resume
4 pages
Ads Unit 3
No ratings yet
Ads Unit 3
8 pages
14 Queryexecution2
No ratings yet
14 Queryexecution2
6 pages
Sayan Ghosh 26900123054 Distributed Database System Cse 6TH Sem
No ratings yet
Sayan Ghosh 26900123054 Distributed Database System Cse 6TH Sem
11 pages
HPC2
No ratings yet
HPC2
22 pages
Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem
No ratings yet
Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem
11 pages
Parallel Databases
No ratings yet
Parallel Databases
10 pages
06 Application Architecture
No ratings yet
06 Application Architecture
22 pages
Parallel and Distributed Databases in DBMS
No ratings yet
Parallel and Distributed Databases in DBMS
31 pages
Using MonetDB and Dplyr To Work With Large HCUP NIS Data Files
No ratings yet
Using MonetDB and Dplyr To Work With Large HCUP NIS Data Files
31 pages
Parallel and Distributed Databases NOTES
No ratings yet
Parallel and Distributed Databases NOTES
98 pages
Module 4
No ratings yet
Module 4
23 pages
No SQ L Databases
No ratings yet
No SQ L Databases
17 pages
P743-Leis - Dual PDF Dual Format Chinese English
No ratings yet
P743-Leis - Dual PDF Dual Format Chinese English
12 pages

Duckdb Parallelism

Uploaded by

Duckdb Parallelism

Uploaded by

Mark Raasveldt

▸ When building DuckDB we have mostly

▸ Suddenly people are benchmarking our system

▸ We haven’t exactly spend a lot of time

▸ We are now pretty happy with functionality

▸ DuckDB is currently single-threaded

▸ More importantly: parallelism requires a

▸ Parallelism is actually our oldest open issue!

▸ Created one month after the initial commit

▸ So it’s about time :)

▸ Short intro to DBMS parallelism

▸ Inter-query: multiple different queries

▸ Most systems have inter-query

▸ Most useful for OLTP systems

▸ Intra-query is not part of most OLTP

▸ Only useful for complex queries

▸ Exchange operator: original way of

[1993] Encapsulation of Parallelism and

Goetz Graefe et al.

▸ MonetDB uses system similar to exchange

▸ Data is partitioned by mitosis (mergetable?)

▸ Exchange operator works to parallelize queries

▸ But has partitioning/merging overhead…

▸ Alternative: Morsel-driven parallelism

[2014] Morsel-Driven Parallelism: A

Viktor Leis et al.

▸ HT builds of S and T can be trivially parallelized

▸ Need to parallelize inside a pipeline

▸ Use parallelism-aware operators at endpoints

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

Viktor Leis et al.

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

Viktor Leis et al.

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

Viktor Leis et al.

TPC-H SF100, 32 cores

[2014] Morsel-Driven Parallelism: A

Viktor Leis et al.

▸ Morsel-driven parallelism seems like the way to go

▸ How can we add it to our vegetable soup?

▸ DuckDB uses a pull-based volcano execution model

▸ Every operator implements a GetChunk operator

▸ BuildHashTable: pull everything from RHS (build-side)

▸ Have to split up building from probing

▸ Contention is in the source and sink of a pipeline

▸ Combine is the final chance to merge any changes

▸ Example: Ungrouped Aggregate

▸ Example: Ungrouped Aggregate

▸ Example: Ungrouped Aggregate

▸ Example: Ungrouped Aggregate

▸ Example: Ungrouped Aggregate

▸ Interface for parallel scans:

▸ One task is created for every invoked callback

▸ Currently only implemented for base table

▸ Parquet/Pandas is not very complicated

▸ Creating the pipelines

Encounter hash join: create build pipeline in RHS*

This 0 is a bug in our profiler with parallel execution atm, TODO

▸ Doesn’t matter for TPC-H (there is always a Top-N

▸ But can definitely matter for other queries!

▸ We can push a “materialize” operator that

▸ Notes on load balancing

▸ Except the calling thread: this thread works on its

▸ Short queries will not have to wait for long queries

▸ How much does TPC-H benefit from inter-pipeline

▸ Small speedup in some queries

* Actually 3 threads, due to an off-by-one :)

▸ Preliminary results (including splitting of pipelines)

▸ Join HT/scan also have a (low) amount of contention

You might also like