HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 content-type: text/html; charset="UTF-8" date: Thu, 31 Jul 2025 23:20:32 GMT server: Apache strict-transport-security: max-age=31536000; includeSubDomains cache-control: s-maxage=128076, max-age=3, stale-while-revalidate=0, stale-if-error=0 set-cookie: 1368813=4265%2C9172; expires=Thu, 31-Jul-2025 23:22:12 GMT; Max-Age=100; path=/ content-encoding: gzip access-control-allow-credentials: true x-frame-options: DENY x-content-type-options: nosniff vary: Accept-Encoding,Cookie x-cache: Miss from cloudfront via: 1.1 3e4af6ffbc2fb603daf8897afc5cc7f6.cloudfront.net (CloudFront) x-amz-cf-pop: BOM78-P9 x-amz-cf-id: J_PJ32Hz2IcyeqwHOaNp6N09UPmcrBySqf6HTPZgqnXclLnlm1gmCA== Pairwise Deletion - GeeksforGeeks

Courses
Tutorials
Practice
Jobs

Notifications

Mark all as read

All

View All

Notifications

Mark all as read

All

Unread

Read

You're all caught up!!

Data Science IBM Certification
Data Science
Data Science Projects
Data Analysis
Data Visualization
Machine Learning
ML Projects
Deep Learning
NLP
Computer Vision
Artificial Intelligence

Open In App

Explore GfG Courses

Share Your Experiences

Pairwise vs Listwise Deletion Deletion in B+ Tree PL/SQL DELETE JOIN SQL DELETE JOIN MySQL DELETE JOIN delete keyword in C++

DSA to Development Course

Pairwise Deletion

Last Updated : 23 Jul, 2025

Comments

Improve

Suggest changes

Like Article

Report

Pairwise Deletion is one method used to handle missing data, especially when estimating correlations or covariances.

Missing Data

Missing data can occur due to various reasons, such as sensor failures, manual entry errors or respondents skipping survey questions. Missing data mechanisms are categorized as:

MCAR (Missing Completely at Random): The probability of missingness is independent of observed or unobserved data.
MAR (Missing at Random): The probability of missingness depends on observed data but not unobserved data.
MNAR (Missing Not at Random): The probability of missingness depends on unobserved data.

Challenges Posed by Missing Data

Bias in Estimates: Ignoring missing data can introduce bias in statistical estimates.
Reduced Efficiency: Fewer data points result in less information for model training, impacting accuracy.
Complexity in Models: Advanced imputation methods may increase computational costs.

Pairwise Deletion

Pairwise deletion is a technique that evaluates each pair of variables based only on cases (rows) that have non-missing values for that pair. In simpler terms, it computes statistics such as correlations or covariances using the maximum number of available data points for each pair.

This approach contrasts with listwise deletion, which removes entire rows from the dataset if any value is missing.

To understand pairwise deletion, let us consider an example of calculating the correlation coefficient 𝑟 between two variables 𝑋 and 𝑌.

r = \frac{\sum_{i=1}^{n}(x_i - \bar{X})(y_i - \bar{Y})}{\sqrt{\sum_{i=1}^{n}(x_i - \bar{X})^2 \cdot \sum_{i=1}^{n}(y_i - \bar{Y})^2}}

Where:

n: Number of observations (data points) in the dataset.
x_i, y_i: Individual values of variables X and Y respectively.
\bar{X}, \bar{Y}: Mean (average) of X and Y

Pairwise Deletion in Machine Learning

Feature Selection: When computing feature correlations to identify redundant features.
Covariance Estimation: For multivariate normal data, pairwise deletion can estimate covariance matrices.
Exploratory Data Analysis: Visualizing relationships between features while tolerating missing data.

Python

import numpy as np
import pandas as pd
# Sample dataset with missing values
data = {
    'X': [1, 2, np.nan, 4, 5],
    'Y': [5, np.nan, 2, 4, 3],
    'Z': [2, 3, 4, 5, np.nan]
}
df = pd.DataFrame(data)
def pairwise_correlation(df):
    """Compute pairwise correlations."""
    correlations = {}
    for col1 in df.columns:
        for col2 in df.columns:
            if col1 != col2:
                # Drop rows where either value is missing
                valid_data = df[[col1, col2]].dropna()
                if not valid_data.empty:
                    corr = valid_data.corr().iloc[0, 1]
                    correlations[f'{col1}-{col2}'] = corr
    return correlations
correlations = pairwise_correlation(df)
print("Pairwise Correlations:")
print(correlations)

Output

Pairwise Correlations:
{'X-Y': np.float64(-0.9607689228305226), 'X-Z': np.float64(1.0), 'Y-X': np.float64(-0.9607689228305226), 'Y-Z': np.float64(-0.4999999999999999), 'Z-X': np.float64(1.0), 'Z-Y': n...

Pairwise Deletion in Deep Learning

In deep learning, handling missing data typically involves imputation techniques rather than deletion. However, pairwise deletion might still be used during preprocessing for correlation analysis.

Visualizing Pairwise Correlations with a Heatmap

Python

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
# Step 1: Create a sample DataFrame (Replace this with your actual dataset)
data = {
    "A": [1, 2, 3, 4, 5],
    "B": [2, 3, 4, 5, 6],
    "C": [5, 4, 3, 2, 1],
    "D": [3, 3, 3, 3, 3]
}
df = pd.DataFrame(data)  # Define the DataFrame
# Step 2: Compute the pairwise correlation matrix
pairwise_corr = df.corr(min_periods=1)  # Ensure there are enough non-NaN values
# Step 3: Create a heatmap using Seaborn
sns.heatmap(pairwise_corr, annot=True, cmap="coolwarm")
# Step 4: Customize and display the plot
plt.title("Pairwise Correlation Heatmap")
plt.show()

Output:

correlation-heatmap — Pairwise Correlation Heatmap

Alternatives to Pairwise Deletion

While pairwise deletion has its merits, it is not always the best method. Here are some alternatives:

Listwise Deletion: Removes rows with any missing values.
Mean/Median Imputation: Replaces missing values with the mean or median of the feature.
Multiple Imputation: Generates multiple plausible datasets and averages results.
Machine Learning Imputation: Uses models like k-Nearest Neighbors or MICE (Multiple Imputation by Chained Equations).
Matrix Factorization: Fills missing values by leveraging low-rank structures in the data.

Advantages of Pairwise Deletion

Maximizes Data Usage: Retains more data compared to listwise deletion.
Simple to Implement: Pairwise deletion is computationally straightforward for small datasets.
Preserves Relationships: Useful for specific analyses like correlation estimation.

Disadvantages of Pairwise Deletion

Inconsistent Sample Sizes: Each pair of variables may use a different subset of data, leading to challenges in comparing results.
Potential Bias: If the data is not MCAR, pairwise deletion may introduce bias.
Not Suitable for Complex Models: For regression or machine learning models, inconsistent sample sizes can complicate training.

Pairwise vs Listwise Deletion

Bhumi Mittal

Improve

Article Tags :

R Language
AI-ML-DS

2k+ interested Geeks

GATE CSE 2028 [Semester & Placement Preparation]

265k+ interested Geeks

Master Competitive Programming - Complete Beginner to Advanced

11k+ interested Geeks

GATE CS/IT 2026 Complete Course [with Placement Preparation]

Corporate & Communications Address:

A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)

Registered Address:

K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305

Advertise with us

Company
About Us
Legal
Privacy Policy
Careers
In Media
Contact Us
Corporate Solution
Campus Training Program

Explore
Job-A-Thon
Offline Classroom Program
DSA in JAVA/C++
Master System Design
Master CP
Videos

Tutorials
Python
Java
C++
PHP
GoLang
SQL
R Language
Android

DSA
DSA Tutorial
Problem Of The Day
GfG 160
DSA 360
DSA Roadmap
DSA Interview Questions
Competitive Programming

Data Science & ML
Data Science With Python
Machine Learning
ML Maths
Data Visualisation
Pandas
NumPy
NLP
Deep Learning

Web Technologies
HTML
CSS
JavaScript
TypeScript
ReactJS
NextJS
NodeJs
Bootstrap
Tailwind CSS

Python Tutorial
Python Examples
Django Tutorial
Python Projects
Python Tkinter
Web Scraping
OpenCV Tutorial
Python Interview Question

Computer Science
GATE CS Notes
Operating Systems
Computer Network
Database Management System
Software Engineering
Digital Logic Design
Engineering Maths

DevOps
Git
AWS
Docker
Kubernetes
Azure
GCP
DevOps Roadmap

System Design
High Level Design
Low Level Design
UML Diagrams
Interview Guide
Design Patterns
OOAD
System Design Bootcamp
Interview Questions

School Subjects
Mathematics
Physics
Chemistry
Biology
Social Science
English Grammar

Databases
SQL
MYSQL
PostgreSQL
PL/SQL
MongoDB

Preparation Corner
Company-Wise Recruitment Process
Aptitude Preparation
Puzzles
Company-Wise Preparation

More Tutorials
Software Development
Software Testing
Product Management
Project Management
Linux
Excel
All Cheat Sheets

Courses
IBM Certification Courses
DSA and Placements
Web Development
Data Science
Programming Languages
DevOps & Cloud

Programming Languages
C Programming with Data Structures
C++ Programming Course
Java Programming Course
Python Full Course

Clouds/Devops
DevOps Engineering
AWS Solutions Architect Certification
Salesforce Certified Administrator Course

GATE 2026
GATE CS Rank Booster
GATE DA Rank Booster
GATE CS & IT Course - 2026
GATE DA Course 2026
GATE Rank Predictor

We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy