0% found this document useful (0 votes)

6 views14 pages

PostgreSQL Table Partitioning

The document outlines a step-by-step process for migrating a non-partitioned PostgreSQL table to a partitioned table structure, including creating partitions for yearly data from 2020 to 2024. It details the creation of a backfill log table, running a batch backfill with logging, and setting up a live trigger for data synchronization during migration. Additionally, it covers final switchover steps, optional cleanup, and methods for detaching and archiving old partitions.

Uploaded by

ethiodbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views14 pages

PostgreSQL Table Partitioning

Uploaded by

ethiodbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

PostgreSQL Version 14 to 17.

5
Step 1: Create Non-Partitioned Source Table and Insert Sample Data

DROP TABLE IF EXISTS sales_raw;

CREATE TABLE sales_raw (

id serial PRIMARY KEY,
sale_date date NOT NULL,
customer_name text,
amount numeric
);

-- Insert sample data only from 2020 to 2024

INSERT INTO sales_raw (sale_date, customer_name, amount)
SELECT
day,
'Customer_' || (random()*100)::int,
(random()*1000)::numeric(10,2)
FROM generate_series('2020-01-01'::date, '2024-12-31'::date, '1 day') day,
generate_series(1, 50); -- ~91,250 rows
Step 2: Create Partitioned Table with Composite Primary Key

CREATE TABLE sales_raw_new

(
id int NOT NULL,
sale_date date NOT NULL,
customer_name text,
amount numeric,
PRIMARY KEY (sale_date, id)
)
PARTITION BY RANGE (sale_date);
Step 3: Create Yearly Partitions (2020–2024)
CREATE TABLE sales_raw_2020 PARTITION OF sales_raw_new
FOR VALUES FROM ('2020-01-01') TO ('2021-01-01');

CREATE TABLE sales_raw_2021 PARTITION OF sales_raw_new

FOR VALUES FROM ('2021-01-01') TO ('2022-01-01');

CREATE TABLE sales_raw_2022 PARTITION OF sales_raw_new

FOR VALUES FROM ('2022-01-01') TO ('2023-01-01');

CREATE TABLE sales_raw_2023 PARTITION OF sales_raw_new

FOR VALUES FROM ('2023-01-01') TO ('2024-01-01');

CREATE TABLE sales_raw_2024 PARTITION OF sales_raw_new

FOR VALUES FROM ('2024-01-01') TO ('2025-01-01’);

Note :
PostgreSQL partitions use half-open intervals: [from, to) — so the end date is
exclusive.
Step 4: Optional Indexes on Partitions

CREATE INDEX idx_sales_2020_id ON sales_raw_2020(id);

CREATE INDEX idx_sales_2021_id ON sales_raw_2021(id);
CREATE INDEX idx_sales_2022_id ON sales_raw_2022(id);
CREATE INDEX idx_sales_2023_id ON sales_raw_2023(id);
CREATE INDEX idx_sales_2024_id ON sales_raw_2024(id);
Step 5: Create Backfill Log Table

DROP TABLE IF EXISTS backfill_log;

CREATE TABLE backfill_log (

id serial PRIMARY KEY,
batch_no int,
last_id int,
rows_copied int,
started_at timestamp,
ended_at timestamp,
status text
);
Step 6: Run Batch Backfill with Logging (Idempotent Safe)
DO $$
DECLARE
batch_size int := 10000;
last_id int := 0;
rows_copied int;
batch_no int := 0;

BEGIN

LOOP
batch_no := batch_no + 1;

WITH to_insert AS (
SELECT id, sale_date, customer_name, amount
FROM sales_raw
WHERE id > last_id
AND sale_date >= '2020-01-01' AND sale_date < '2025-01-01'
ORDER BY id
LIMIT batch_size
),
inserted AS (
INSERT INTO sales_raw_new(id, sale_date, customer_name, amount)
SELECT * FROM to_insert
ON CONFLICT (sale_date, id) DO NOTHING
RETURNING id
)
SELECT MAX(id), COUNT(*) INTO last_id, rows_copied FROM inserted;

IF rows_copied = 0 THEN
EXIT;
END IF;

INSERT INTO backfill_log(batch_no, last_id, rows_copied, started_at, ended_at, status)

VALUES (batch_no, last_id, rows_copied, clock_timestamp(), clock_timestamp(),
'done');

PERFORM pg_sleep(0.2); -- Optional pause to reduce I/O

END LOOP;

END $$;
Step 7: Monitor Progress (in another session)

SELECT * FROM backfill_log ORDER BY id DESC LIMIT 10;

Step 8: Setup Live Trigger Sync (for changes during migration)
CREATE OR REPLACE FUNCTION trg_sync_sales_raw()
RETURNS TRIGGER AS $$

BEGIN
IF TG_OP = 'INSERT' THEN
INSERT INTO sales_raw_new(id, sale_date, customer_name, amount)
VALUES (NEW.id, NEW.sale_date, NEW.customer_name, NEW.amount)
ON CONFLICT (sale_date, id) DO NOTHING;

ELSIF TG_OP = 'UPDATE' THEN

DELETE FROM sales_raw_new WHERE sale_date = OLD.sale_date AND id = OLD.id;

INSERT INTO sales_raw_new(id, sale_date, customer_name, amount)

VALUES (NEW.id, NEW.sale_date, NEW.customer_name, NEW.amount)
ON CONFLICT (sale_date, id) DO NOTHING;

ELSIF TG_OP = 'DELETE' THEN

DELETE FROM sales_raw_new WHERE sale_date = OLD.sale_date AND id = OLD.id;
END IF;

RETURN NULL;

END;
$$ LANGUAGE plpgsql;

-----------------------------------------------------------------------------------------------------------------------

CREATE TRIGGER trg_sync_all

AFTER INSERT OR UPDATE OR DELETE ON sales_raw
FOR EACH ROW EXECUTE FUNCTION trg_sync_sales_raw();
Step 9: Final Switchover

-- Backup old table

ALTER TABLE sales_raw RENAME TO sales_raw_old;

-- Activate partitioned version

ALTER TABLE sales_raw_new RENAME TO sales_raw;
Step 10: (Optional) Clean Up

-- Drop old table after verification

DROP TABLE sales_raw_old;

Step 11: Detach and Archive Old Partition (e.g., 2020)
Detach a Partition (e.g., sales_raw_2020)This will remove the partition from the main table but
preserve the table and its data:

ALTER TABLE sales_raw DETACH PARTITION sales_raw_2020;

Now, sales_raw_2020 is a standalone table.

Optional: Archive the Detached Table

You can export it using tools like:

a) Plain SQL dump:

pg_dump -t sales_raw_2020 -F c -f /home/postgres/backup/sales_raw_2020.backup your_db

b) CSV export:
COPY sales_raw_2020 TO '/home/postgres/backup/sales_raw_2020.csv' CSV HEADER;

Then:
Move to AWS S3 or Glacier
Delete the partition table locally (if desired)

DROP TABLE sales_raw_2020;

Step 12: ATTACH Old Partition (e.g., 2020)

Restore Later When Needed

If users request archived data, you can reattach it back as a
partition:

-- Ensure it still matches structure

ALTER TABLE sales_raw ATTACH PARTITION sales_raw_2020
FOR VALUES FROM ('2020-01-01') TO ('2021-01-01');
Thank you

I’m happy to discuss PostgreSQL features further, as it is open source and full of possibilities.

Disclaimer:
The information provided here is based on my personal knowledge, experience, and publicly
available sources.

https://www.linkedin.com/in/mariyanclement

COMP214 FinalExam Practical (Lab) F2023 Version1
No ratings yet
COMP214 FinalExam Practical (Lab) F2023 Version1
19 pages
Nubank Analytics Engineer Case Resolution
100% (1)
Nubank Analytics Engineer Case Resolution
14 pages
The Angel - Pearl Buck
No ratings yet
The Angel - Pearl Buck
7 pages
Review Answers: Your Answer
50% (2)
Review Answers: Your Answer
3 pages
Oil Field Data Handbook
100% (2)
Oil Field Data Handbook
148 pages
Simple Sabotage Field Manual
50% (2)
Simple Sabotage Field Manual
16 pages
Snowflake Mini Project
No ratings yet
Snowflake Mini Project
7 pages
Farooq Resume
No ratings yet
Farooq Resume
3 pages
Managing Large Data Sets in SQL Server 2005 and 2008
No ratings yet
Managing Large Data Sets in SQL Server 2005 and 2008
5 pages
Drg. Reza Fajarsyah Putra, SP - BM Prodi Ikg FK Univ Yarsi
No ratings yet
Drg. Reza Fajarsyah Putra, SP - BM Prodi Ikg FK Univ Yarsi
38 pages
Tyre Industry in India - Me Project
100% (2)
Tyre Industry in India - Me Project
17 pages
First Language Acquisition Theories
No ratings yet
First Language Acquisition Theories
28 pages
RF Heating: Created in COMSOL Multiphysics 5.3a
No ratings yet
RF Heating: Created in COMSOL Multiphysics 5.3a
22 pages
Learning Objectives: Introduction W
No ratings yet
Learning Objectives: Introduction W
238 pages
Company Profile PDF
No ratings yet
Company Profile PDF
38 pages
Formulation, Development and in Vitro Characterization of Modified Release Tablets of Capecitabine
No ratings yet
Formulation, Development and in Vitro Characterization of Modified Release Tablets of Capecitabine
42 pages
Review of Anthropometric Considerations For Tractor Seat Design
No ratings yet
Review of Anthropometric Considerations For Tractor Seat Design
9 pages
"You Can Do It" Datawarehouse: Beginner To Advanced in Two Hours
No ratings yet
"You Can Do It" Datawarehouse: Beginner To Advanced in Two Hours
59 pages
FUN Transmissions: by Bill Brayton
No ratings yet
FUN Transmissions: by Bill Brayton
4 pages
Adam Sanchez - Resume-References
No ratings yet
Adam Sanchez - Resume-References
3 pages
MCQ
67% (3)
MCQ
274 pages
Natural Disasters
No ratings yet
Natural Disasters
14 pages
0 - Ritu Sharma Old CV
No ratings yet
0 - Ritu Sharma Old CV
2 pages
Os Lec 4 Process
No ratings yet
Os Lec 4 Process
7 pages
Lexicology Study Guide
No ratings yet
Lexicology Study Guide
34 pages
SQL That Tunes Itself: Oracle 12c's Built-In Tuning Features
No ratings yet
SQL That Tunes Itself: Oracle 12c's Built-In Tuning Features
23 pages
Scripts
No ratings yet
Scripts
20 pages
To Restore Older Aging Snapshots When A Full ETL Is Run
No ratings yet
To Restore Older Aging Snapshots When A Full ETL Is Run
28 pages
9 - Class INTSO Work Sheet - 3 - Basic Concepts of Geometry
No ratings yet
9 - Class INTSO Work Sheet - 3 - Basic Concepts of Geometry
8 pages
Bi Abap
No ratings yet
Bi Abap
10 pages
Segmentspace Management
No ratings yet
Segmentspace Management
15 pages
T-SQL Cheat Sheet
100% (1)
T-SQL Cheat Sheet
20 pages
First DWH - Script
No ratings yet
First DWH - Script
7 pages
4.1-Database Objects
No ratings yet
4.1-Database Objects
19 pages
551 1R-14 Preview
No ratings yet
551 1R-14 Preview
4 pages
Resumão - SQL Com Databricks
No ratings yet
Resumão - SQL Com Databricks
2 pages
Graph Operations in Applied Math
100% (1)
Graph Operations in Applied Math
20 pages
LSD Thesis Statement
100% (3)
LSD Thesis Statement
5 pages
Exploratory Data Analytics With SQL
No ratings yet
Exploratory Data Analytics With SQL
27 pages
Parabola Assignment Solutions
No ratings yet
Parabola Assignment Solutions
34 pages
Триггер мисол
No ratings yet
Триггер мисол
6 pages
Really Big Elephants: Data Warehousing Postgresql
No ratings yet
Really Big Elephants: Data Warehousing Postgresql
62 pages
Dbms File-1
No ratings yet
Dbms File-1
20 pages
Old Man Yells at Cloud Know Your Meme
No ratings yet
Old Man Yells at Cloud Know Your Meme
1 page
Dbms Exp5 Solution
No ratings yet
Dbms Exp5 Solution
7 pages
HW 6 SQL
No ratings yet
HW 6 SQL
2 pages
Lab 7
No ratings yet
Lab 7
3 pages
Blink Basket
No ratings yet
Blink Basket
8 pages
Sales Analysis Using Python and SQL
No ratings yet
Sales Analysis Using Python and SQL
15 pages
Datawarehouse
No ratings yet
Datawarehouse
5 pages
RDBMS Lab3
No ratings yet
RDBMS Lab3
7 pages
Odoo 17: Partition Tables for Performance
No ratings yet
Odoo 17: Partition Tables for Performance
6 pages
Blinkit & Zepto Interview Questions
No ratings yet
Blinkit & Zepto Interview Questions
21 pages
SM Chapter 5 Booster - Examiner Notes
No ratings yet
SM Chapter 5 Booster - Examiner Notes
60 pages
TPs ED
No ratings yet
TPs ED
7 pages
Why Choose Jolly Phonics Flyer - 250125 - 035602
No ratings yet
Why Choose Jolly Phonics Flyer - 250125 - 035602
8 pages
Богданов Никита Сергеевич М3313 Этап 6
No ratings yet
Богданов Никита Сергеевич М3313 Этап 6
5 pages
Финк Максим Антонович этап4
No ratings yet
Финк Максим Антонович этап4
5 pages
Богданов Никита Сергеевич М3313 Этап 4
No ratings yet
Богданов Никита Сергеевич М3313 Этап 4
5 pages
Guide To PostgreSQL Table Partitioning - by Rasiksuhail - Medium
No ratings yet
Guide To PostgreSQL Table Partitioning - by Rasiksuhail - Medium
18 pages
Data Analyst
No ratings yet
Data Analyst
17 pages
Echftp Auto Part
No ratings yet
Echftp Auto Part
4 pages
1) Create Table Client - Master45
No ratings yet
1) Create Table Client - Master45
5 pages
Course SQL Scripts
No ratings yet
Course SQL Scripts
18 pages
s21 72
No ratings yet
s21 72
3 pages
Three SQL Techniques
No ratings yet
Three SQL Techniques
11 pages
SQL Queties Fsda
No ratings yet
SQL Queties Fsda
3 pages
Pawan Transfer
No ratings yet
Pawan Transfer
2 pages
P1
No ratings yet
P1
3 pages
Zomato SQL Analysis Project
No ratings yet
Zomato SQL Analysis Project
23 pages
23bce277 DBMS PR1
No ratings yet
23bce277 DBMS PR1
20 pages
One Minute Manager Notes
No ratings yet
One Minute Manager Notes
8 pages
PostgreSQL Advanced CheatSheet 1731972672
No ratings yet
PostgreSQL Advanced CheatSheet 1731972672
10 pages
EY & Zepto Data Analyst Interview Questions
No ratings yet
EY & Zepto Data Analyst Interview Questions
24 pages
Zafin Learn Session - PostgreSQL Performance For Application Developers
No ratings yet
Zafin Learn Session - PostgreSQL Performance For Application Developers
58 pages
Azure SQL To Mysql
No ratings yet
Azure SQL To Mysql
26 pages
SQL Interview Prep Guide
No ratings yet
SQL Interview Prep Guide
8 pages
Data Project 3
No ratings yet
Data Project 3
3 pages
Markdown To PDF
No ratings yet
Markdown To PDF
5 pages
10 Advanced Oracle DB Tuning Techniques
No ratings yet
10 Advanced Oracle DB Tuning Techniques
27 pages
SQL Project - Exploring Trends, Segmentation & KPIs
No ratings yet
SQL Project - Exploring Trends, Segmentation & KPIs
43 pages
DBMSBCOM
No ratings yet
DBMSBCOM
24 pages
Accenture Interview Questions
No ratings yet
Accenture Interview Questions
15 pages
Myntra SQL
No ratings yet
Myntra SQL
34 pages
How To Clean Data Using SQL
No ratings yet
How To Clean Data Using SQL
12 pages
SQL Case Study
No ratings yet
SQL Case Study
15 pages
PostgreSQL For Data Science & Machine Learning
No ratings yet
PostgreSQL For Data Science & Machine Learning
17 pages

PostgreSQL Table Partitioning

Uploaded by

PostgreSQL Table Partitioning

Uploaded by

PostgreSQL Version 14 to 17.

DROP TABLE IF EXISTS sales_raw;

CREATE TABLE sales_raw (

-- Insert sample data only from 2020 to 2024

CREATE TABLE sales_raw_new

CREATE TABLE sales_raw_2021 PARTITION OF sales_raw_new

CREATE TABLE sales_raw_2022 PARTITION OF sales_raw_new

CREATE TABLE sales_raw_2023 PARTITION OF sales_raw_new

CREATE TABLE sales_raw_2024 PARTITION OF sales_raw_new

CREATE INDEX idx_sales_2020_id ON sales_raw_2020(id);

DROP TABLE IF EXISTS backfill_log;

CREATE TABLE backfill_log (

INSERT INTO backfill_log(batch_no, last_id, rows_copied, started_at, ended_at, status)

PERFORM pg_sleep(0.2); -- Optional pause to reduce I/O

SELECT * FROM backfill_log ORDER BY id DESC LIMIT 10;

ELSIF TG_OP = 'UPDATE' THEN

INSERT INTO sales_raw_new(id, sale_date, customer_name, amount)

ELSIF TG_OP = 'DELETE' THEN

CREATE TRIGGER trg_sync_all

-- Backup old table

-- Activate partitioned version

-- Drop old table after verification

DROP TABLE sales_raw_old;

ALTER TABLE sales_raw DETACH PARTITION sales_raw_2020;

Now, sales_raw_2020 is a standalone table.

Optional: Archive the Detached Table

a) Plain SQL dump:

DROP TABLE sales_raw_2020;

Restore Later When Needed

-- Ensure it still matches structure

You might also like