Handling of Missing Values

The document describes a 5-phase algorithm for data imputation, focusing on handling missing values represented by '?'. It involves scanning through rows to find similar rows based on matching column values, and then replacing missing values with the majority value from similar rows or the overall dataset. The algorithm emphasizes the importance of counting occurrences to determine the most frequent value for imputation.

Uploaded by

ambuj.kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views1 page

Handling of Missing Values

Uploaded by

ambuj.kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 1

It is a N-phase algorithm (N = 5 is usually sufficient, but can be increased if needed).

Each phase is made of following steps.

Scan through all rows.

for(i = 0 to numberOfRows) do
{
for(j = 0 to numberOfColumns) do {
if(data[i][j] is ‘?’) then {
Step 1: Find similar rows
- Scan through entire dataset rows EXCEPT row number ‘i’.
- If at least “half” of column values (which are NOT ?) match with this row and
column number ‘j’ is NOT ?, then include that row as a similarRow.

Step 2 (a) : If similarRows is an empty set, then

Scan through all rows except row number ‘i’. Look at value of column ‘j’. If non-
empty, then keep a count of that value for column[j].
After all rows have been scanned, then choose the value whose count is
maximum.
Replace data[i][j] with that majority value.

Step 2 (b) : If similarRows is non-empty set, then

Scan through all rows in similarRow and keep track of values in column[j].
Choose the value which occurs maximum number of times and replace data[i][j]
with that majority value.
}
}
}

Gaussian Elimination Spreadsheet
0% (1)
Gaussian Elimination Spreadsheet
2 pages
Gaussian Elimination With Partial Pivoting
No ratings yet
Gaussian Elimination With Partial Pivoting
4 pages
Gaussian Elimination Tutorial
No ratings yet
Gaussian Elimination Tutorial
2 pages
Linear Equations for Mathematicians
No ratings yet
Linear Equations for Mathematicians
6 pages
Ramanujan Type 1 Pi Approximation Formul
No ratings yet
Ramanujan Type 1 Pi Approximation Formul
15 pages
Ashok Singh v. State of Uttar Pradesh & Anr. R1: State of Uttar Pradesh R2: Ravindra Pratap Singh
No ratings yet
Ashok Singh v. State of Uttar Pradesh & Anr. R1: State of Uttar Pradesh R2: Ravindra Pratap Singh
17 pages
Lapping Machine
No ratings yet
Lapping Machine
2 pages
Thông Kê
No ratings yet
Thông Kê
9 pages
Rakesh Bhanot v. M/s. Gurdas Agro Pvt. LTD.: (2025) 4 S.C.R. 573: 2025 INSC 445
No ratings yet
Rakesh Bhanot v. M/s. Gurdas Agro Pvt. LTD.: (2025) 4 S.C.R. 573: 2025 INSC 445
30 pages
Omca 6
No ratings yet
Omca 6
1 page
Lab 5
No ratings yet
Lab 5
6 pages
Lecture 2 Venn Diagrams
No ratings yet
Lecture 2 Venn Diagrams
6 pages
Introduction To Programming With Matlab: Exercises
100% (1)
Introduction To Programming With Matlab: Exercises
18 pages
Computation 09 00029 v2
No ratings yet
Computation 09 00029 v2
49 pages
PERT CPM SCurve Presentable
No ratings yet
PERT CPM SCurve Presentable
11 pages
Gauss Total
No ratings yet
Gauss Total
2 pages
ML Lab Programs
No ratings yet
ML Lab Programs
16 pages
Kishore Chhabra v. The State of Haryana & Ors.: (2025) 4 S.C.R. 327: 2025 INSC 419
No ratings yet
Kishore Chhabra v. The State of Haryana & Ors.: (2025) 4 S.C.R. 327: 2025 INSC 419
9 pages
Quiz - 2 - Mock Paper - Unanswere
No ratings yet
Quiz - 2 - Mock Paper - Unanswere
22 pages
The Product Is Irrational: N. A. Carella
No ratings yet
The Product Is Irrational: N. A. Carella
10 pages
Numec Assignment 1 - 2
No ratings yet
Numec Assignment 1 - 2
2 pages
11 Assignment Problem
No ratings yet
11 Assignment Problem
61 pages
Sorting
No ratings yet
Sorting
1 page
Gaussian Elimination for Engineers
No ratings yet
Gaussian Elimination for Engineers
3 pages
Gauss Elimination With Partial Pivoting
No ratings yet
Gauss Elimination With Partial Pivoting
2 pages
First Completely Painted Row or Column - Editorial
No ratings yet
First Completely Painted Row or Column - Editorial
11 pages
Code Vba
No ratings yet
Code Vba
2 pages
Definition. A Matrix Is Upper-Triangular If - 0 0 0 - 0 0 - . - . 0 - . - .
No ratings yet
Definition. A Matrix Is Upper-Triangular If - 0 0 0 - 0 0 - . - . 0 - . - .
2 pages
Import CSV Qu Cs
No ratings yet
Import CSV Qu Cs
2 pages
Practice Problem 3-1 (Solution)
No ratings yet
Practice Problem 3-1 (Solution)
4 pages
Math Constants: Approximations Explored
No ratings yet
Math Constants: Approximations Explored
11 pages
Matrices
No ratings yet
Matrices
6 pages
NC-Assignment No 1
No ratings yet
NC-Assignment No 1
4 pages
Gauss
No ratings yet
Gauss
2 pages
Module V
No ratings yet
Module V
48 pages
CH 2 Solutions To Linear Equations
No ratings yet
CH 2 Solutions To Linear Equations
125 pages
15 Mark Question 2
No ratings yet
15 Mark Question 2
4 pages
ML 2nd PRG
No ratings yet
ML 2nd PRG
4 pages
A Reconstruction Assessment Error Analys
No ratings yet
A Reconstruction Assessment Error Analys
17 pages
W10PA Sep2022
No ratings yet
W10PA Sep2022
18 pages
SPQTLabManual (27 01 2025)
No ratings yet
SPQTLabManual (27 01 2025)
7 pages
Aa 6344
No ratings yet
Aa 6344
15 pages
IV - ML Lab
No ratings yet
IV - ML Lab
31 pages
Explanation DuplicateRemovalFormula
No ratings yet
Explanation DuplicateRemovalFormula
4 pages
1746330870
No ratings yet
1746330870
26 pages
CT Week 8 GA
No ratings yet
CT Week 8 GA
23 pages
Matrix Inversion
No ratings yet
Matrix Inversion
4 pages
Assignmnet Problem Balanced With Solver
No ratings yet
Assignmnet Problem Balanced With Solver
32 pages
Métodos Numéricos: Guía y Algoritmos
No ratings yet
Métodos Numéricos: Guía y Algoritmos
6 pages
Gaussian Elimination Matlab
No ratings yet
Gaussian Elimination Matlab
3 pages
ML1 3 Merged
No ratings yet
ML1 3 Merged
19 pages
2023MT13122 Maths Assignment 1
No ratings yet
2023MT13122 Maths Assignment 1
22 pages
Dishes Review
No ratings yet
Dishes Review
43 pages
Mathematics 08 02204 v2
No ratings yet
Mathematics 08 02204 v2
23 pages
ML Lab Manual-99
No ratings yet
ML Lab Manual-99
23 pages
Final Exam
No ratings yet
Final Exam
28 pages
AOP Unit 3 Chapter 2
No ratings yet
AOP Unit 3 Chapter 2
58 pages
Kmapwordr
No ratings yet
Kmapwordr
25 pages
Here Is A Pascal Program To Solve Small Problems Using The Simplex Algorithm
No ratings yet
Here Is A Pascal Program To Solve Small Problems Using The Simplex Algorithm
12 pages
ML Lab Programs
No ratings yet
ML Lab Programs
21 pages
Identifythecorn!: - FG@ - F - G - Fotf
No ratings yet
Identifythecorn!: - FG@ - F - G - Fotf
10 pages
Excel Notes 865f76eb d2fd 425b 8c53 Bb4c3a7eaa7b
No ratings yet
Excel Notes 865f76eb d2fd 425b 8c53 Bb4c3a7eaa7b
20 pages
Gauss Elimination for Algebra Students
No ratings yet
Gauss Elimination for Algebra Students
11 pages
AIML LAB Final
No ratings yet
AIML LAB Final
13 pages
Eisenstein Series and Approximations To: Bruce C. Berndt and Heng Huat Chan
No ratings yet
Eisenstein Series and Approximations To: Bruce C. Berndt and Heng Huat Chan
16 pages
MITOCW - Ocw-18.06-F99-Lec08 - 300k
No ratings yet
MITOCW - Ocw-18.06-F99-Lec08 - 300k
20 pages
Dictionary
No ratings yet
Dictionary
68 pages
VBA2017 (Student)
No ratings yet
VBA2017 (Student)
19 pages
Lecture - Linear - Systems PDF
No ratings yet
Lecture - Linear - Systems PDF
31 pages
Simplex Method (Big M) : Eng. Shimaa Abouelenein
No ratings yet
Simplex Method (Big M) : Eng. Shimaa Abouelenein
28 pages
11 Geometrical Method of Determination of the Value of Pi (π)
No ratings yet
11 Geometrical Method of Determination of the Value of Pi (π)
12 pages
Quantitative Techniques-Ii Problem With Maximisation & Unbalanced
No ratings yet
Quantitative Techniques-Ii Problem With Maximisation & Unbalanced
20 pages
1 s2.0 S0898122108004306 Main
No ratings yet
1 s2.0 S0898122108004306 Main
7 pages
An Investigation Into Computing The Digi
No ratings yet
An Investigation Into Computing The Digi
7 pages
Research Pythagoras 1
No ratings yet
Research Pythagoras 1
2 pages
Research Pythagoras 2
No ratings yet
Research Pythagoras 2
2 pages
Figure 2
No ratings yet
Figure 2
1 page

Handling of Missing Values

Uploaded by

Handling of Missing Values

Uploaded by

It is a N-phase algorithm (N = 5 is usually sufficient, but can be increased if needed).

Each phase is made of following steps.

Scan through all rows.

Step 2 (a) : If similarRows is an empty set, then

Step 2 (b) : If similarRows is non-empty set, then

You might also like