0% found this document useful (0 votes)

44 views19 pages

GeostatsPy Spatial Data Declustering

Uploaded by

cesarengmina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views19 pages

GeostatsPy Spatial Data Declustering

Uploaded by

cesarengmina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Open Source Spatial Data Analytics in Python with GeostatsPy II

Spatial Uncertainty Modeling with GeostatsPy

Lecture outline . . .

• Spatial Data Declustering

• Interactive Demo with GeostatsPy

• Workflow with GeostatsPy

Michael Pyrcz, The University of Texas at Austin

Motivation

Biased, naïve statistics from biased spatial data samples

result in a biased uncertainty model.

6 Area of Interest
1

7
4 5
y

2
3
Samples

Michael Pyrcz, The University of Texas at Austin

Recorded
Lectures

Michael Pyrcz, The University of Texas at Austin

Open Source Spatial Data Analytics in Python with GeostatsPy II
Spatial Uncertainty Modeling with GeostatsPy

Lecture outline . . .

• Spatial Data Declustering

Michael Pyrcz, The University of Texas at Austin

Spatial Data
Collection
Subsurface data is collected to answer questions:

• how far does the contaminant plume extend? – sample peripheries

• where is the fault? – drill based on seismic interpretation
• what is the highest mineral grade? – sample the best part
• who far does the reservoir extend? – offset drilling

and to maximize value directly:

• maximize production rates

• maximize recovery of a resource
• high grade early value for shorter project pay off period

Michael Pyrcz, The University of Texas at Austin

Clustered
Sample
Let’s make an estimate for an Area / Volume of Interest:
• Inference of the population from a sample.

1 6 Area of Interest

7
4 5
y

2
3
Samples

• To assess the average porosity to calculate OIP

Michael Pyrcz, The University of Texas at Austin

Clustered
Sample

Let’s make an estimate for an Area / Volume of Interest:

1 High 6
7
4 5
y

2
Low 3

• What if we knew from seismic that the reservoir quality

is better in the top left area?
Michael Pyrcz, The University of Texas at Austin
Clustered
Sample

Let’s make an estimate for an Area / Volume of Interest:

8 6
1
9 10
12 7
4 5
11
y

2
3

• What if we kept drilling in the high value region of the

area of interest?
Michael Pyrcz, The University of Texas at Austin
Clustered
Sample

How would our estimate of average porosity change as

we drilled more wells?:
Well Average
Porosity

Sampling Bias

Number of Wells Drilled

• The naïve sample average becomes more biased!

• We need a method to correct for clustered samples.
Michael Pyrcz, The University of Texas at Austin
Some Clustered
Data Exhaustive True Distribution

Here’s data and x-ray vision:

• Location map of 64 wells. with
truth model.
• See the error between the
samples and the underlying truth
model.

Samples and Exhaustive Truth Model Sparse Sample Distribution

Michael Pyrcz, The University of Texas at Austin

Cell
Declustering
Cell Declustering, a method for calculating declustering weights
• divide the volume of interest into a grid of cells 𝑙 = 1, … , 𝐿 count the
occupied cells Lo and the number in each cell 𝑛𝑙 , 𝑙 = 1, … , 𝐿𝑜 , weight
inversely by number in cell (standardize by 𝐿𝑜 )
1 𝑛
Data Weights 𝑤(𝐮𝑗 ) =
𝑛𝑙 𝐿0
1/7 weight x (289 data / 36 cells) = 3.27

𝟏 weight x (289 data / 36 cells) = 1.09

𝟏
weight x (289 data / 36 cells) = 1.63
𝟒

Sum of all weights = n

Nominal / nonclustered weight = 1.0

All data in the same cell get the same weight.

Michael Pyrcz, The University of Texas at Austin
Declustering
Weights

• Declustering weights
1. 1.0 nominal weight
2. < 1.0 reduced weight 1.0

3. > 1.0 increased weight

• Note: some software

programs assume:
𝑛

෍ 𝑤(𝐮𝒊 ) = 1
𝑖
1
then ‘nominal weight’ is
𝑛

Michael Pyrcz, The University of Texas at Austin

Declustered
Distribution

• Updated distribution with

declustering weights

• Now data file / table include values

and paired weights based on spatial
arrangement.

• Possible to calculate any weighted

statistic.

– For example, declustered mean:

σ𝑛𝑖 𝑤(𝐮𝑖 )𝑧(𝐮𝑖 )

𝑧ҧ = 𝑛
σ𝑖 𝑤(𝐮𝒊 ) = 𝑛
• Python MatPlotLib hist allows for a
vector of weights.
Michael Pyrcz, The University of Texas at Austin
Cell-based
Declustering Offsets
• The result is sensitive to exact location of the cell mesh

• This sensitivity is removed by iterativing the mesh position,

calculating the weights for each and averaging the result.

Michael Pyrcz, The University of Texas at Austin

Cell Size
Selection
• Plot declustered mean versus the cell size for a range of cell sizes:

• There is no theory that says we are looking for a minimum when the values are
clustered in high values or a maximum when clustered in low values – it just seems to
make sense
• The result can be very sensitive to large scale trends – it is often better to choose
the cell size by visual inspection and some sensitivity studies
• Could choose the cell size so that there is approximately one datum per cell in the
sparsely sampled areas, the nominal spacing

Michael Pyrcz, The University of Texas at Austin

Open Source Spatial Data Analytics in Python with GeostatsPy II
Spatial Uncertainty Modeling with GeostatsPy

Lecture outline . . .

• Interactive Demo with

GeostatsPy

• Explore the impact of

cell size and cell
offsets

Interactive_Declustering.ipynb
Michael Pyrcz, The University of Texas at Austin
Open Source Spatial Data Analytics in Python with GeostatsPy II
Spatial Uncertainty Modeling with GeostatsPy

Lecture outline . . .

• Workflow with GeostatsPy

Michael Pyrcz, The University of Texas at Austin

Spatial Simulation
Workflow with
GeostatsPy

Let’s walkthrough a more

thorough a spatial data
declustering workflow:

• calculate data weights

• visualize and QC the results

Python Jupyter variogram calculation

(GeostatsPy_declustering.ipynb).

Michael Pyrcz, The University of Texas at Austin

Open Source Spatial Data Analytics in Python with GeostatsPy II
Spatial Uncertainty Modeling with GeostatsPy

Lecture outline . . .

• Spatial Simulation

• Interactive Demo with GeostatsPy

• Workflow with GeostatsPy

Michael Pyrcz, The University of Texas at Austin

Professor Brigitte Le Roux, Professor Henry Rouanet - Multiple Correspondence Analysis (Quantitative Applications in The Social Sciences) - Sage Publications, Inc (2010) PDF
No ratings yet
Professor Brigitte Le Roux, Professor Henry Rouanet - Multiple Correspondence Analysis (Quantitative Applications in The Social Sciences) - Sage Publications, Inc (2010) PDF
126 pages
Declus
No ratings yet
Declus
8 pages
2 Celldeclustering
No ratings yet
2 Celldeclustering
6 pages
S Gems
100% (1)
S Gems
61 pages
Adjusting For Preferential Sampling by Declustering The Data
No ratings yet
Adjusting For Preferential Sampling by Declustering The Data
2 pages
Duvernay Subsurface Modeling Workflow
No ratings yet
Duvernay Subsurface Modeling Workflow
16 pages
NB 13
No ratings yet
NB 13
27 pages
02 IntroGeostatsPy Variogram Modeling
No ratings yet
02 IntroGeostatsPy Variogram Modeling
13 pages
Chapter 06
No ratings yet
Chapter 06
51 pages
Geostatistical Análisis
No ratings yet
Geostatistical Análisis
105 pages
ML 8
No ratings yet
ML 8
5 pages
Unit 3 Unsupervised Learning
No ratings yet
Unit 3 Unsupervised Learning
9 pages
Clustering Fraud Detection
No ratings yet
Clustering Fraud Detection
45 pages
DM Unit 4
No ratings yet
DM Unit 4
12 pages
01 IntroGeostatsPy Variogram Calculation
No ratings yet
01 IntroGeostatsPy Variogram Calculation
14 pages
Spatial Statistics in Geographical Information Science From Interpolation To Probabilistic Robotics
No ratings yet
Spatial Statistics in Geographical Information Science From Interpolation To Probabilistic Robotics
12 pages
Kriging vs. Simulation, A 2D Map Example - GeostatsPy Well-Documented Demonstration Geostatistical Workflows
No ratings yet
Kriging vs. Simulation, A 2D Map Example - GeostatsPy Well-Documented Demonstration Geostatistical Workflows
16 pages
NB 14
No ratings yet
NB 14
15 pages
Unit 4 Cluster Analysis 4
No ratings yet
Unit 4 Cluster Analysis 4
25 pages
1 An Introduction To Machine Learning With Scikit Learn
No ratings yet
1 An Introduction To Machine Learning With Scikit Learn
2 pages
Machine Learning On Geographical Data Using Python
No ratings yet
Machine Learning On Geographical Data Using Python
309 pages
Clustering in Python-Dr. Afsaneh Javadi
No ratings yet
Clustering in Python-Dr. Afsaneh Javadi
8 pages
Gds Scipy16 PDF
No ratings yet
Gds Scipy16 PDF
190 pages
Chapter - 1: 1.1 Overview
No ratings yet
Chapter - 1: 1.1 Overview
50 pages
Data Mining Unit-Iv
No ratings yet
Data Mining Unit-Iv
34 pages
Clustering 2
No ratings yet
Clustering 2
17 pages
Information Sciences: Francesco Gullo, Giovanni Ponti, Andrea Tagarelli, Sergio Greco
No ratings yet
Information Sciences: Francesco Gullo, Giovanni Ponti, Andrea Tagarelli, Sergio Greco
17 pages
Nadir GSLIB
No ratings yet
Nadir GSLIB
55 pages
Enhanced Synthetic Oversampling For Multiclass Imbalanced Data
No ratings yet
Enhanced Synthetic Oversampling For Multiclass Imbalanced Data
20 pages
Reaction Paper On BFR Clustering Algorithm
No ratings yet
Reaction Paper On BFR Clustering Algorithm
5 pages
Agglomerative Mean-Shift Clustering
No ratings yet
Agglomerative Mean-Shift Clustering
7 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
4 pages
Unit 4
No ratings yet
Unit 4
5 pages
2011 Csde Esda Exercise
No ratings yet
2011 Csde Esda Exercise
7 pages
Spatial Data Indexing and Queries
No ratings yet
Spatial Data Indexing and Queries
56 pages
Lecture 7.1 Spatial Analysis Raster Data
No ratings yet
Lecture 7.1 Spatial Analysis Raster Data
55 pages
Task 2
No ratings yet
Task 2
3 pages
A Network Flow Model For Biclustering Via Optimal Re-Ordering of Data Matrices
No ratings yet
A Network Flow Model For Biclustering Via Optimal Re-Ordering of Data Matrices
12 pages
Geoprocessing
No ratings yet
Geoprocessing
6 pages
Spatial Panel-Data Models Using Stata: 17, Number 1, Pp. 139-180
No ratings yet
Spatial Panel-Data Models Using Stata: 17, Number 1, Pp. 139-180
42 pages
Gist One Pro Supplement
No ratings yet
Gist One Pro Supplement
9 pages
The Geostatistical Workflow: What Is Geostatistics
No ratings yet
The Geostatistical Workflow: What Is Geostatistics
2 pages
2002 124 CCG
No ratings yet
2002 124 CCG
25 pages
Romary 2015
No ratings yet
Romary 2015
8 pages
Lec 41
No ratings yet
Lec 41
6 pages
Lab 3
No ratings yet
Lab 3
12 pages
Interpretable Clustering: An Optimization Approach: Dimitris Bertsimas Agni Orfanoudaki Holly Wiberg
No ratings yet
Interpretable Clustering: An Optimization Approach: Dimitris Bertsimas Agni Orfanoudaki Holly Wiberg
50 pages
GIS Advance Training Program Content-2
No ratings yet
GIS Advance Training Program Content-2
7 pages
A Parsimonious, Computationally Efficient Machine Learning Method For Spatial Regression
No ratings yet
A Parsimonious, Computationally Efficient Machine Learning Method For Spatial Regression
23 pages
Codigo 2
No ratings yet
Codigo 2
21 pages
Image Segmentation With Kmeans
No ratings yet
Image Segmentation With Kmeans
17 pages
Apriori Algorithm & Clustering Guide
No ratings yet
Apriori Algorithm & Clustering Guide
8 pages
Big Data: An Optimized Approach For Cluster Initialization: Open Access Research
No ratings yet
Big Data: An Optimized Approach For Cluster Initialization: Open Access Research
19 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
Tutorial de Clasificación Supervisada de Imágenes de Satétite Con QGIS y R Statistics
No ratings yet
Tutorial de Clasificación Supervisada de Imágenes de Satétite Con QGIS y R Statistics
21 pages
08 Occupancy Mapping
No ratings yet
08 Occupancy Mapping
41 pages
Lab5 Instructions 10.2
No ratings yet
Lab5 Instructions 10.2
12 pages
ML Notes 1
No ratings yet
ML Notes 1
3 pages
Declustering Horizontal Well Data
No ratings yet
Declustering Horizontal Well Data
10 pages
S2 Linear Regression LKW 9march2025
No ratings yet
S2 Linear Regression LKW 9march2025
23 pages
4.introduction To Biostatistics
No ratings yet
4.introduction To Biostatistics
30 pages
6 Anal
No ratings yet
6 Anal
1 page
Savickas Et Al 2009 - A Paradigm For Career Constructing in The 21st Century
100% (2)
Savickas Et Al 2009 - A Paradigm For Career Constructing in The 21st Century
12 pages
Foreign Direct Investment, Information Technology and Economic Growth Dynamics in Sub-Saharan Africa
No ratings yet
Foreign Direct Investment, Information Technology and Economic Growth Dynamics in Sub-Saharan Africa
32 pages
Discussion Assignment Unit 3
No ratings yet
Discussion Assignment Unit 3
6 pages
Pricing and Reserving in The General Insurance Industry
No ratings yet
Pricing and Reserving in The General Insurance Industry
10 pages
Doing Data Science in R An Introduction For Social Scientists - 1st Edition High-Resolution PDF Download
100% (13)
Doing Data Science in R An Introduction For Social Scientists - 1st Edition High-Resolution PDF Download
14 pages
Pearson Edexcel Level 3 Advanced Subsidiary GCE in Mathematics (8MA0) Pearson Edexcel Level 3 Advanced GCE in Mathematics (9MA0)
No ratings yet
Pearson Edexcel Level 3 Advanced Subsidiary GCE in Mathematics (8MA0) Pearson Edexcel Level 3 Advanced GCE in Mathematics (9MA0)
21 pages
Pearson Edexcel GCE As and AL Mathematics Data Set - Issue 1 (1) .Xls - 0
No ratings yet
Pearson Edexcel GCE As and AL Mathematics Data Set - Issue 1 (1) .Xls - 0
149 pages
Industrial Training
No ratings yet
Industrial Training
20 pages
Multiple Linear Regression: Points of Significance
No ratings yet
Multiple Linear Regression: Points of Significance
2 pages
Senior High School (SHS) Subject Offerings Per Track/Strand: St. Camillus College of Manaoag Foundation, Inc
100% (1)
Senior High School (SHS) Subject Offerings Per Track/Strand: St. Camillus College of Manaoag Foundation, Inc
6 pages
Adavanced Qualitative Research Methods Versus Advanced Quantitative Research Methods
No ratings yet
Adavanced Qualitative Research Methods Versus Advanced Quantitative Research Methods
13 pages
Operating Characteristic (OC) Curve
100% (2)
Operating Characteristic (OC) Curve
4 pages
Parameters vs. Statistics Guide
No ratings yet
Parameters vs. Statistics Guide
11 pages
Canonical Correlation Analysis Guide
No ratings yet
Canonical Correlation Analysis Guide
8 pages
ABDC Journal List 08022017
No ratings yet
ABDC Journal List 08022017
232 pages
Lab 01 - Scientific Method and Statistics (New Version)
0% (1)
Lab 01 - Scientific Method and Statistics (New Version)
25 pages
Handling Missing Data in R
No ratings yet
Handling Missing Data in R
30 pages
Mixed Method
100% (3)
Mixed Method
37 pages
Enhanced Electronic Vital Events Registration System For Ethiopia (EEVERSE)
No ratings yet
Enhanced Electronic Vital Events Registration System For Ethiopia (EEVERSE)
70 pages
SimProject Report
No ratings yet
SimProject Report
16 pages
Intro to Statistics for Students
No ratings yet
Intro to Statistics for Students
6 pages
Dynamic Modeling, Predictive Control and Performance Monitoring
No ratings yet
Dynamic Modeling, Predictive Control and Performance Monitoring
6 pages
Class: Ix Subject: Mathematics Assignment 12: Statistics
100% (1)
Class: Ix Subject: Mathematics Assignment 12: Statistics
2 pages
Manual de Psicopatologia, Vol I 3rd Edition Amparo Belloch Instant Download
100% (1)
Manual de Psicopatologia, Vol I 3rd Edition Amparo Belloch Instant Download
66 pages
Kruskal-Wallis Test: and It'S Implementation in R Programming
No ratings yet
Kruskal-Wallis Test: and It'S Implementation in R Programming
14 pages
SDCA Thesis Complete Guide
No ratings yet
SDCA Thesis Complete Guide
56 pages

GeostatsPy Spatial Data Declustering

Uploaded by

GeostatsPy Spatial Data Declustering

Uploaded by

Open Source Spatial Data Analytics in Python with GeostatsPy II

Spatial Uncertainty Modeling with GeostatsPy

• Spatial Data Declustering

• Interactive Demo with GeostatsPy

• Workflow with GeostatsPy

Michael Pyrcz, The University of Texas at Austin

Biased, naïve statistics from biased spatial data samples

Michael Pyrcz, The University of Texas at Austin

Michael Pyrcz, The University of Texas at Austin

• Spatial Data Declustering

Michael Pyrcz, The University of Texas at Austin

• how far does the contaminant plume extend? – sample peripheries

and to maximize value directly:

• maximize production rates

Michael Pyrcz, The University of Texas at Austin

• To assess the average porosity to calculate OIP

Michael Pyrcz, The University of Texas at Austin

Let’s make an estimate for an Area / Volume of Interest:

• What if we knew from seismic that the reservoir quality

Let’s make an estimate for an Area / Volume of Interest:

• What if we kept drilling in the high value region of the

How would our estimate of average porosity change as

Number of Wells Drilled

• The naïve sample average becomes more biased!

Here’s data and x-ray vision:

Samples and Exhaustive Truth Model Sparse Sample Distribution

Michael Pyrcz, The University of Texas at Austin

𝟏 weight x (289 data / 36 cells) = 1.09

Sum of all weights = n

All data in the same cell get the same weight.

3. > 1.0 increased weight

• Note: some software

Michael Pyrcz, The University of Texas at Austin

• Updated distribution with

• Now data file / table include values

• Possible to calculate any weighted

– For example, declustered mean:

σ𝑛𝑖 𝑤(𝐮𝑖 )𝑧(𝐮𝑖 )

• This sensitivity is removed by iterativing the mesh position,

Michael Pyrcz, The University of Texas at Austin

Michael Pyrcz, The University of Texas at Austin

• Interactive Demo with

• Explore the impact of

• Workflow with GeostatsPy

Michael Pyrcz, The University of Texas at Austin

Let’s walkthrough a more

• calculate data weights

• visualize and QC the results

Python Jupyter variogram calculation

Michael Pyrcz, The University of Texas at Austin

• Interactive Demo with GeostatsPy

• Workflow with GeostatsPy

Michael Pyrcz, The University of Texas at Austin

You might also like