CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: nginx date: Mon, 13 Oct 2025 06:10:38 GMT content-type: text/html; charset=UTF-8 vary: Accept-Encoding x-cache-enabled: True content-security-policy: block-all-mixed-content x-httpd: 1 x-cdn-c: all content-encoding: gzip x-sg-cdn: 1 x-ce: asia-southeast1-6nd4 x-proxy-cache: MISS x-proxy-cache-info: 0 NC:000000 UP: host-header: 8441280b0c35cbc1147f8ba998a563a7 How to Use the Python statistics.variance() Function

How to Use the Python statistics.variance() Function

by Matthew Mayo Last updated on October 31, 2024

Variance is a statistical measure that represents the degree of spread in a set of data points, helping to quantify the extent to which individual data points differ from the dataset’s mean. To calculate a sample’s variance in python, the statistics module offers the statistics.variance() function. In this article, we will explore how to efficiently use this function, its syntax, and how it fits into broader statistical analysis.

Using the statistics.variance() Function

To use the the function, first import the statistics module.

import statistics

The syntax for the statistics.variance() function is straightforward:

statistics.variance(data, xbar=None)

data: This is a sequence of numeric values (like a list or tuple) for which to calculate the variance
xbar (optional): This represents the mean of the data; if not provided, the function calculates the mean internally

Let’s see how to calculate the sample variance using a simple dataset. Consider the following list of numbers:

import statistics

data = [10, 12, 23, 23, 16, 23, 21]
sample_variance = statistics.variance(data)
print(sample_variance)

# Output: 31.238095238095237

In this example, the function takes the dataset data, computes the mean, and then calculates the variance. When you run this code, it will output the calculated sample variance.

To pre-calculate and then pass the mean to the function, the code would be as follows:

import statistics

data = [10, 12, 23, 23, 16, 23, 21]
sample_mean = sum(data) / len(data)

sample_variance = statistics.variance(data, sample_mean)
print(sample_variance)

# Output: 31.238095238095237

When working with variance calculations, it’s good to know what potential errors may arise, such as providing insufficient data points or non-numeric inputs. The statistics.variance() function raises a StatisticsError in certain cases.

import statistics

try:
    # Invalid data: only one data point
    invalid_data = [5]
    statistics.variance(invalid_data)
except statistics.StatisticsError as e:
    print("Error:", e)
	
# Output: Error: variance requires at least two data points

In this code, attempting to calculate the variance of a single data point will raise an error, which we catch and print.

It’s worth nopting that the statistics module include 2 variance functions:

statistics.pvariance(): used when your dataset represents the entire population
statistics.variance() used when your dataset is a sample from that population

Wrapping Up

In summary, the statistics.variance() function in Python is a valuable tool for calculating the sample variance of a dataset. For more on this and other Python statistical functions, check out the statistics module’s official documentation.

Matthew Mayo

Matthew Mayo (@mattmayo13) holds a master’s degree in computer science and a graduate diploma in data mining. As managing editor of KDnuggets & Statology, and contributing editor at Machine Learning Mastery, Matthew aims to make complex data science concepts accessible. His professional interests include natural language processing, language models, machine learning algorithms, and exploring emerging AI. He is driven by a mission to democratize knowledge in the data science community. Matthew has been coding since he was 6 years old.

How to Use the Python statistics.variance() Function

Using the statistics.variance() Function

Wrapping Up

Leave a Reply Cancel reply

Join the Statology Community