0% found this document useful (0 votes)

99 views7 pages

Introduction To Clinical Data-1

materi

Uploaded by

Azka Salsabila

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views7 pages

Introduction To Clinical Data-1

materi

Uploaded by

Azka Salsabila

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

INTRO TO CLINICAL DATA STUDY GUIDE

MODULE 1 – ASKING AND ANSWERING QUESTIONS VIA CLINICAL

DATA MINING

LEARNING OBJECTIVES

1. Explain the main steps in the data mining workflow

2. Describe the important categories of research questions
3. List properties that make a research question a useful one

THE DATA MINING WORKFLOW

Goal in this course is to explain how clinical data can be used to answer research questions to
improve the health of patients and populations.

What we'll cover

1. How to choose research questions that are important

2. Structure of the healthcare system to understand how (and what) patient data are generated
3. Different kinds of data
4. Overview of the processing and analysis methods that get us answers to our questions
5. Problems and biases that can arise as well as ways to manage them

Data mining work will be referred in this course which has four steps:

1. Pose a research question.

2. Identify one or more data sources that can answer the question.

Copyright © Stanford University

3. Extract and transform the data into a form needed for the analysis.
4. Conduct the analysis using those data

After completing the steps, results are evaluated and repeat the process if necessary.

Two representations of healthcare data that we will focus on this course are:

1. a patient timeline
2. a patient-feature matrix.

REAL LIFE EXAMPLE

Laura

● A teenager with a chronic disease called Systemic Lupus Erythematosus (SLE).

● Has a flare up of the condition and develops proteinuria (protein in the urine), pancreatitis
(inflammation of the pancreas), and has antiphospholipid antibodies in her blood.

● She is at risk for developing a blood clot.

Step 1 in the data mining workflow:

● Our clinical question:

“Should a teenager with SLE who develops proteinuria and antiphospholipid antibodies
receive an anticoagulant medication?”

Copyright © Stanford University

(X) Review medical literature

(X) Past experience

(X) Consult experts

• One approach is to examine what has happened to similar patients in the past, drawing on all
the relevant data that appears in the electronic medical record, or EMR, of a large academic
medical center.

Now we identified our data source, so we have completed Step 2 in the workflow.

EMR data are not necessarily organized in a way that makes searches straightforward. Medical
expertise is needed to choose the diagnosis codes and medical terms that can identify patients that
are in a similar situation.

Steps we would need to take.

1) Find all pediatric patients in the medical record system. Using a query based on patient age.

2) Diagnosis code of SLE.

3) Find patients with proteinuria by checking on the values listed in a urine test.

4) Find patients with antiphospholipid antibodies.

5) Laboratory test

a) Recorded in numeric form

Copyright © Stanford University

b) Result in textual document; Use treatment with aspirin as a proxy marker

c) Confirm the results of that search by checking laboratory results for those antibodies
for those patients who have the results in a searchable form

6) Now in the midst of Step 3; we have defined our criteria that allows to identify the
appropriate group of similar patients

7) Outcome of interest is clotting

a) Search for "thrombus", "thrombosis", and "blood clot", again relying on clinical
expertise to choose those terms.

Analysis is straightforward: compare the risk of clotting in teenagers with SLE, proteinuria, and
antiphospholipid antibodies, to the baseline risk of clotting in teenagers with SLE.

• Need to get the patient data out of the EMR system in a form that allows analysis.
• Found that the relative risk of getting a blood clot when proteinuria and antiphospholipid
antibodies are present is twice as high, when compared to the baseline, thus choosing to treat
with an anticoagulant.

We have now completed Step 4 of the “data mining workflow”.

Information in the form of a patient timeline.

• Start with data collected by the healthcare system, in this case, the electronic medical record,
which includes diagnosis codes, lab results, medication orders, and text notes written by
clinicians.

• Arrange these data on a timeline, one for each patient and then identify pediatric patients,
and flag those with SLE, some of whom develop the comorbidities of concern (proteinuria,
antiphospholipid antibodies).
• Use the timeline to count which patients experienced the outcome of clotting after they
developed each clinical condition. Then compute the fraction of those with each condition
who developed blood clots to arrive at the relative risk.

Revisit the data mining workflow steps.

1. What was the clinical question? Is the risk of clotting in a teenager with SLE, proteinuria,
and antiphospholipid antibodies high enough to warrant treatment with an anticoagulant?

2. What is the data source? The electronic medical record.

3. What extract/transform steps did we take? We defined how we will find teenagers with SLE,
and how we will define subgroups based on clinical conditions. This involved the use of
diagnosis codes in some cases and the use of medical expertise to craft searches for other
terms. In one case we used a proxy term in the search, followed by a second, confirmatory
step.

4. Finally, we compared the risk of clotting in the subgroups to the risk of clotting for teenagers
with SLE in order to guide our decision to treat.

In this example we primarily used one data source, the EMR. Remember that there are no “perfect”
ways of doing all the steps we reviewed in the example and it is best to think of the entire data

mining process as something that should be done with an expert human in the loop rather than by
an automated algorithm that provides answers without knowing the larger context of the situation.

TYPES OF RESEARCH QUESTIONS

• A descriptive question asks for a summary of the data

• An exploratory question attempts to find what patterns might exist in the dataset available.
• An inferential question looks for patterns that go beyond just the particular dataset
available. The goal is to find generalizable knowledge
• A predictive question looks for quantitative relationships between some features and the
outcome of interest.
• A causal question looks for the effect of changes in one variable on a second variable.
• A Deterministic question is directly addressing the underlying mechanism

Clinical data are best suited for answering descriptive, exploratory, inferential, and predictive
questions.

We ask these questions to accomplish two primary goals:

1. Risk stratification to decide if to treat

2. Data-driven selection of how to treat

What do you think about our analysis for Laura

The question we asked was about treating Laura who was

experiencing a set of clinical conditions. However, the question we
answered was about the proportion of patients with a set of clinical
conditions who developed a blood clot. What we answered was a
descriptive question relying on counts and proportions.

We then need to make an assumption that what happened in the

past to those patients is likely to happen to Laura as well. The
assumption, and the resulting conclusion, provides us with a ‘risk-
stratification’.

If we conclude that Laura is at high risk, what treatment to offer is

clear, which is to use anticoagulation. In real life we would also need

to draw similar conclusions about the risks of adverse outcomes resulting from the treatment itself
before making a final decision.

What makes answering a question useful:

• How many lives are affected? What is the disease burden?

• What is the chance that results will have a beneficial effect on the target community?
• What happens as a result of answering the question?
• Does knowing the answer help more than one constituent group among patients, healthcare
professionals, and payers of care?

APS & Thrombosis
No ratings yet
APS & Thrombosis
4 pages
Thesis Updated
No ratings yet
Thesis Updated
151 pages
Question Bank 67
No ratings yet
Question Bank 67
77 pages
Clinical Problem Solving
No ratings yet
Clinical Problem Solving
30 pages
Frankovich 2011
No ratings yet
Frankovich 2011
2 pages
DhBqO7 - vRayQaju - 71WsBg - Intro To Clinical Data Study Guide - M4
No ratings yet
DhBqO7 - vRayQaju - 71WsBg - Intro To Clinical Data Study Guide - M4
9 pages
Introduction To Evidence Based Medicine
No ratings yet
Introduction To Evidence Based Medicine
66 pages
2 - Clinical Data Lecture
No ratings yet
2 - Clinical Data Lecture
24 pages
A Markov State-Space Model of Lupus Nephritis Disease Dynamics
No ratings yet
A Markov State-Space Model of Lupus Nephritis Disease Dynamics
11 pages
Clinical Data Quality - A Data Life Cycle Perspective
No ratings yet
Clinical Data Quality - A Data Life Cycle Perspective
10 pages
A Fuzzy Based Association Mining Approach For Medical Disease Prediction
No ratings yet
A Fuzzy Based Association Mining Approach For Medical Disease Prediction
3 pages
S36: Secondary Use of Patient Data For Research and Quality Improvement: Tips, Tricks, Tools, Troubles, Triumphs and Other Topics
No ratings yet
S36: Secondary Use of Patient Data For Research and Quality Improvement: Tips, Tricks, Tools, Troubles, Triumphs and Other Topics
25 pages
AI Project Cycle
No ratings yet
AI Project Cycle
25 pages
Generic Fuzzy Bayesian Expert System in Cardiology
No ratings yet
Generic Fuzzy Bayesian Expert System in Cardiology
65 pages
Act. 3
No ratings yet
Act. 3
1 page
1959 Reasoning Foundations of Medical Diagnosis SCIENCE
No ratings yet
1959 Reasoning Foundations of Medical Diagnosis SCIENCE
14 pages
Machine Learning in Medicine Cookbook Premium Download
100% (17)
Machine Learning in Medicine Cookbook Premium Download
17 pages
Text Extraction Engine To Upgrade Clinical Decision Support System
No ratings yet
Text Extraction Engine To Upgrade Clinical Decision Support System
4 pages
A Machine Learning Approach For Identifying Disease-Treatment Relations in Short Texts
No ratings yet
A Machine Learning Approach For Identifying Disease-Treatment Relations in Short Texts
7 pages
A Class Based Approach For Medical Classification of Chest Pain
No ratings yet
A Class Based Approach For Medical Classification of Chest Pain
5 pages
Pharmacoepidemiology Data Sources
No ratings yet
Pharmacoepidemiology Data Sources
41 pages
Three Dimensional Model For Diagnostic Prediction: A Data Mining Approach
No ratings yet
Three Dimensional Model For Diagnostic Prediction: A Data Mining Approach
5 pages
Association Rule Mining For Healthcare Data Analysis
No ratings yet
Association Rule Mining For Healthcare Data Analysis
16 pages
ECOP
No ratings yet
ECOP
4 pages
Base Paper
No ratings yet
Base Paper
4 pages
Generalities and Clinical Diagnosis, Strategies For Developing A Clinical Diagnosis
No ratings yet
Generalities and Clinical Diagnosis, Strategies For Developing A Clinical Diagnosis
4 pages
Chapter 1 Critical Thinking
100% (2)
Chapter 1 Critical Thinking
3 pages
Warko Karnadihardja, Reno Rudiman: (Critical Appraisal of The Topics)
No ratings yet
Warko Karnadihardja, Reno Rudiman: (Critical Appraisal of The Topics)
53 pages
Application of The Bayesian Framework Methodology in Medical Diagnostics Which Uses Data From Previous Cases To Determine The Probability of Patients Having Certain Diseases
No ratings yet
Application of The Bayesian Framework Methodology in Medical Diagnostics Which Uses Data From Previous Cases To Determine The Probability of Patients Having Certain Diseases
8 pages
3 - Medicine 1 - Clinical Reasoning, Assessment, and Recording - Dr. Dominguez - August 18, 2014
No ratings yet
3 - Medicine 1 - Clinical Reasoning, Assessment, and Recording - Dr. Dominguez - August 18, 2014
2 pages
BioGPT - Generative Language Models For Healthcare and Beyond - Tao Qin, Renqian Luo, Yingce Xia
No ratings yet
BioGPT - Generative Language Models For Healthcare and Beyond - Tao Qin, Renqian Luo, Yingce Xia
24 pages
Big Data Meets Medical Dosimetry
No ratings yet
Big Data Meets Medical Dosimetry
20 pages
Panalgo-Machine Learning Use Case Booklet-Volume 4
No ratings yet
Panalgo-Machine Learning Use Case Booklet-Volume 4
16 pages
Ultimate Biostats Guide
No ratings yet
Ultimate Biostats Guide
22 pages
CRI StatisticalModeling Methods
No ratings yet
CRI StatisticalModeling Methods
89 pages
Bayesian Analysis of Clinical Studies 21 January 2022
No ratings yet
Bayesian Analysis of Clinical Studies 21 January 2022
55 pages
TNCAB-2019 Paper 16
No ratings yet
TNCAB-2019 Paper 16
19 pages
Biostatistics: ABSITE Review Series Sarah Abdulla
No ratings yet
Biostatistics: ABSITE Review Series Sarah Abdulla
30 pages
Ijccn02322014 1
No ratings yet
Ijccn02322014 1
8 pages
Clinical Decision Making
No ratings yet
Clinical Decision Making
7 pages
Traduccion Case 2.1
No ratings yet
Traduccion Case 2.1
7 pages
Critical Thinking in Health Assessment
No ratings yet
Critical Thinking in Health Assessment
38 pages
DWM Review Paper
No ratings yet
DWM Review Paper
3 pages
Tnacab-2019 Paper 16
No ratings yet
Tnacab-2019 Paper 16
19 pages
Clinical Reasoning On Doc Faculty Dev Tuesday Version
No ratings yet
Clinical Reasoning On Doc Faculty Dev Tuesday Version
76 pages
Study Designs: Dr. Naveed Zafar Janjua Community Health Sciences The Aga Khan University
No ratings yet
Study Designs: Dr. Naveed Zafar Janjua Community Health Sciences The Aga Khan University
51 pages
Mining and Classifying Medical Documents
No ratings yet
Mining and Classifying Medical Documents
4 pages
Data Mining in Healthcare Systems
No ratings yet
Data Mining in Healthcare Systems
4 pages
From Big Data To Bedside Decision-Making: The Case For AdverseEvents
No ratings yet
From Big Data To Bedside Decision-Making: The Case For AdverseEvents
2 pages
Return To Top: Assessment
No ratings yet
Return To Top: Assessment
5 pages
Modulo 4 - Leitura 3
No ratings yet
Modulo 4 - Leitura 3
3 pages
Final Report Pneumonia Detection Capstone Project Group1 20oct24
No ratings yet
Final Report Pneumonia Detection Capstone Project Group1 20oct24
94 pages
Shameer DIA FinalDeck
No ratings yet
Shameer DIA FinalDeck
27 pages
Drug Info Therapeutics Final Draft 11-8 1
No ratings yet
Drug Info Therapeutics Final Draft 11-8 1
49 pages
Clinician-Centric Medical Data Mining
No ratings yet
Clinician-Centric Medical Data Mining
5 pages
USMLE Step 3 Sample Test With Answers - 1746629698580
No ratings yet
USMLE Step 3 Sample Test With Answers - 1746629698580
82 pages
Clinical Reasoning
No ratings yet
Clinical Reasoning
60 pages
Phase 2
No ratings yet
Phase 2
6 pages
Heart Disease Prediction Using Machine
No ratings yet
Heart Disease Prediction Using Machine
1 page
Rehabilitation Strategies For Pusher Syndrome
No ratings yet
Rehabilitation Strategies For Pusher Syndrome
26 pages
AC Maintenance Checklist 2023
No ratings yet
AC Maintenance Checklist 2023
1 page
FTC Testimony - 99 Percent Lose Money in MLMs
0% (1)
FTC Testimony - 99 Percent Lose Money in MLMs
2 pages
QP - SET-A - Annual Exam - Class 11 - Physics - 2024-25
No ratings yet
QP - SET-A - Annual Exam - Class 11 - Physics - 2024-25
6 pages
KSuite List of Protocols Full
No ratings yet
KSuite List of Protocols Full
459 pages
SAFE-PELLETS Final Part A B C PDF
No ratings yet
SAFE-PELLETS Final Part A B C PDF
165 pages
JIAP April 2016 - To Splint or Not To Splint - The Current Status of Periodontal Splinting
No ratings yet
JIAP April 2016 - To Splint or Not To Splint - The Current Status of Periodontal Splinting
12 pages
GNIPST Bulletin: Jan 2018 Highlights
No ratings yet
GNIPST Bulletin: Jan 2018 Highlights
28 pages
Alkali Silica Reaction in Concrete
No ratings yet
Alkali Silica Reaction in Concrete
3 pages
Final Report..
No ratings yet
Final Report..
43 pages
ĐỀ CƯƠNG ÔN TẬP MÔN TIẾNG ANH LỚP 8 23-24
No ratings yet
ĐỀ CƯƠNG ÔN TẬP MÔN TIẾNG ANH LỚP 8 23-24
5 pages
Anesthesiology Residency Guide
50% (2)
Anesthesiology Residency Guide
12 pages
(PSV) RMG 832 Honeywell
No ratings yet
(PSV) RMG 832 Honeywell
8 pages
Fildis Filipino Sa Ibat Ibang Disiplina 1
No ratings yet
Fildis Filipino Sa Ibat Ibang Disiplina 1
11 pages
Product 572
No ratings yet
Product 572
2 pages
Swine Nutrition Basics
No ratings yet
Swine Nutrition Basics
56 pages
Report 2402410522 1
No ratings yet
Report 2402410522 1
4 pages
EES Thermodynamics Training Guide
No ratings yet
EES Thermodynamics Training Guide
11 pages
TOEFL Structure 29 - 07 (Dragged) (Dragged)
100% (1)
TOEFL Structure 29 - 07 (Dragged) (Dragged)
4 pages
Scenario Training
No ratings yet
Scenario Training
4 pages
Ice Cream AAK
100% (1)
Ice Cream AAK
12 pages
JD Omre226033 en Preview
No ratings yet
JD Omre226033 en Preview
31 pages
Silo 1
No ratings yet
Silo 1
4 pages
AI Travel Web App Development
No ratings yet
AI Travel Web App Development
6 pages
Aloo Parantha & Spring Onion Recipe
No ratings yet
Aloo Parantha & Spring Onion Recipe
3 pages
Gypsum Properties and Uses in Construction
100% (1)
Gypsum Properties and Uses in Construction
16 pages
F325 Lattice Enthalpy
100% (1)
F325 Lattice Enthalpy
12 pages
Surface Chemistry & Colloids
No ratings yet
Surface Chemistry & Colloids
16 pages
Acids, Bases, and Metals Guide
No ratings yet
Acids, Bases, and Metals Guide
2 pages
Defibrillation: DR Ezechiel Nteziryayo
No ratings yet
Defibrillation: DR Ezechiel Nteziryayo
31 pages

Introduction To Clinical Data-1

Uploaded by

Introduction To Clinical Data-1

Uploaded by

INTRO TO CLINICAL DATA STUDY GUIDE

MODULE 1 – ASKING AND ANSWERING QUESTIONS VIA CLINICAL

1. Explain the main steps in the data mining workflow

THE DATA MINING WORKFLOW

What we'll cover

1. How to choose research questions that are important

1. Pose a research question.

Copyright © Stanford University

REAL LIFE EXAMPLE

● A teenager with a chronic disease called Systemic Lupus Erythematosus (SLE).

● She is at risk for developing a blood clot.

Step 1 in the data mining workflow:

● Our clinical question:

Copyright © Stanford University

(X) Past experience

(X) Consult experts

Steps we would need to take.

2) Diagnosis code of SLE.

4) Find patients with antiphospholipid antibodies.

a) Recorded in numeric form

Copyright © Stanford University

7) Outcome of interest is clotting

We have now completed Step 4 of the “data mining workflow”.

Information in the form of a patient timeline.

Copyright © Stanford University

Revisit the data mining workflow steps.

2. What is the data source? The electronic medical record.

Copyright © Stanford University

TYPES OF RESEARCH QUESTIONS

• A descriptive question asks for a summary of the data

We ask these questions to accomplish two primary goals:

1. Risk stratification to decide if to treat

What do you think about our analysis for Laura

The question we asked was about treating Laura who was

We then need to make an assumption that what happened in the

If we conclude that Laura is at high risk, what treatment to offer is

Copyright © Stanford University

What makes answering a question useful:

• How many lives are affected? What is the disease burden?

Copyright © Stanford University

You might also like