What Is Pairwise Testing? Drastically Reduce Test Cases with allpairspy × pytest

Pairwise testing is a test design technique that significantly reduces the number of test cases while guaranteeing that every pair of parameter values is tested at least once. It’s the practical solution to combinatorial explosion when testing across OS, browser, language, and other configuration variables.

💡 In one sentence
Pairwise testing = “Finding bugs efficiently by covering every parameter pair — without testing every combination”

Pairwise testing frees you from the pressure of “we have to test everything” — achieving high bug detection rates with far fewer test cases. Research shows that the vast majority of bugs are caused by the interaction of just two parameters.

📌 Who This Article Is For

QA engineers and developers learning pairwise testing for the first time
Anyone whose combination test case count has grown unmanageable
Those who want to optimize OS / browser / configuration combination testing
Engineers looking to handle scenarios where decision tables have too many conditions

✅ What You’ll Learn

How pairwise testing works and how it reduces test case count
How to auto-generate test cases in Python using allpairspy
How to combine generated cases with pytest’s parametrize for automated testing

👤 About the Author

Written by Yoshi, a QA and test automation engineer with 15+ years of hands-on experience. Pairwise testing is a technique actively used in cross-browser testing and systems with many configuration parameters. Code is publicly available on GitHub: github.com/YOSHITSUGU728/automated-testing-portfolio

“3 OS × 4 browsers × 2 language settings = 24 combinations” — does your test count keep multiplying every time you add a configuration option?

Every new condition causes the full combination count to explode — testing can’t keep up
There’s no time or resources to run every possible combination
But cutting too many tests feels risky — what if something gets missed?

Pairwise testing solves these problems. This article covers everything from the concept, to generating test cases automatically, to applying them in pytest.

📌 Key Takeaways

A test design technique that guarantees every pair of parameter values is tested at least once
3 parameters × 3 values = 27 combinations reduced to around 9–12 test cases
Auto-generate cases with Python’s allpairspy and pass them to parametrize for instant automated tests

What Is Pairwise Testing?
How Does Pairwise Testing Work?
1. Example: Cross-Environment Web App Testing
2. 9 Cases Generated by Pairwise
What’s the Difference Between Pairwise Testing and Decision Tables?
Auto-Generating Test Cases with allpairspy
Combining with pytest parametrize
1. Sample Execution Output
Where Pairwise Testing Is Most Effective
⚠️ 4 Common Pitfalls in Pairwise Testing
FAQ
📋 Summary

What Is Pairwise Testing?

Pairwise testing is a test design technique that generates the minimum set of test cases that guarantees every possible pair of parameter values is covered at least once. It’s also called “all-pairs testing” or “2-way testing.”

Why is covering “pairs” sufficient? Research has shown that the vast majority of bugs are caused by interactions between just 1 or 2 parameters. Bugs that only surface when 3 or more parameters all have specific values simultaneously are extremely rare.

Approach	3 parameters × 3 values	Characteristics
Full combination	27 cases	Complete coverage · not realistic
Pairwise testing	~9–12 cases	Pair coverage · high bug detection rate
Random sampling	Arbitrary	No reproducibility · no coverage guarantee

How Does Pairwise Testing Work?

Here’s a concrete example of how pairwise testing reduces test cases.

Example: Cross-Environment Web App Testing

Imagine testing a system with these three parameters.

📋 Test Parameters

Parameter	Value ①	Value ②	Value ③
OS	Windows	macOS	Linux
Browser	Chrome	Firefox	Safari
Language	English	Japanese	Korean

Full combination: 3 × 3 × 3 = 27 cases → Pairwise: reduced to ~9 cases

9 Cases Generated by Pairwise

#	OS	Browser	Language
1	Windows	Chrome	English
2	Windows	Firefox	Japanese
3	Windows	Safari	Korean
4	macOS	Chrome	Japanese
5	macOS	Firefox	Korean
6	macOS	Safari	English
7	Linux	Chrome	Korean
8	Linux	Firefox	English
9	Linux	Safari	Japanese

These 9 cases cover every pair: OS × Browser, OS × Language, and Browser × Language — all at least once. That’s a 67% reduction from 27 cases, while maintaining complete 2-way coverage.

💡 Key Point: For example, the “macOS × Firefox” pair is covered by case 5, and “Firefox × Korean” is also covered by case 5. Pairwise arranges test cases so every pair appears at least once with maximum efficiency.

What’s the Difference Between Pairwise Testing and Decision Tables?

Both are “combination testing” techniques, but pairwise testing and decision tables have clearly different use cases.

Aspect	Decision Table	Pairwise Testing
Number of conditions	2–5 conditions	Handles 5+ conditions
Coverage	All combinations	All 2-parameter pairs
Best suited for	Business logic, permission management	Environment config, parameter combinations
Parameter dependencies	Dependencies can be explicitly modeled	Best for independent parameters

💡 Decision Guide

2–5 conditions, business logic verification → Decision table
5+ parameters, environment or config combinations → Pairwise testing
Strong dependencies between parameters → Decision table is safer

Auto-Generating Test Cases with allpairspy

Designing pairwise test cases by hand is difficult, so in production the standard approach is to auto-generate them with a tool like allpairspy for Python.

Installation

pip install allpairspy

Generating Test Cases Automatically

from allpairspy import AllPairs

# Define parameters
parameters = [
    ["Windows", "macOS", "Linux"],          # OS
    ["Chrome", "Firefox", "Safari"],         # Browser
    ["English", "Japanese", "Korean"],       # Language
]

# Auto-generate pairwise test cases
test_cases = [case.test_parameters for case in AllPairs(parameters)]

for i, case in enumerate(test_cases, 1):
    print(f"Case {i}: OS={case[0]}, Browser={case[1]}, Language={case[2]}")

Sample Output

Case 1: OS=Windows, Browser=Chrome, Language=English
Case 2: OS=Windows, Browser=Firefox, Language=Japanese
Case 3: OS=Windows, Browser=Safari, Language=Korean
Case 4: OS=macOS, Browser=Chrome, Language=Japanese
Case 5: OS=macOS, Browser=Firefox, Language=Korean
Case 6: OS=macOS, Browser=Safari, Language=English
Case 7: OS=Linux, Browser=Chrome, Language=Korean
Case 8: OS=Linux, Browser=Firefox, Language=English
Case 9: OS=Linux, Browser=Safari, Language=Japanese
# 27 combinations → reduced to 9 cases!

💡 Pro Tip: allpairspy handles the optimal calculation automatically no matter how many parameter values you add. 4 parameters × 3 values = 81 combinations can be reduced to just 12–15 cases.

Combining with pytest parametrize

Pass the generated test cases directly to pytest’s @pytest.mark.parametrize to convert them into automated tests instantly.

import pytest
from allpairspy import AllPairs

# Define parameters
parameters = [
    ["Windows", "macOS", "Linux"],
    ["Chrome", "Firefox", "Safari"],
    ["English", "Japanese", "Korean"],
]

# Generate pairwise test cases
pairwise_cases = [case.test_parameters for case in AllPairs(parameters)]

# Pass directly to parametrize
@pytest.mark.parametrize("os, browser, language", pairwise_cases)
def test_web_app_cross_env(os, browser, language):
    """Pairwise test: verify web app works correctly across environments"""
    print(f"Environment: {os} / {browser} / {language}")
    # Add actual test logic here
    # e.g. launch browser with Playwright and verify
    assert True  # replace with real assertions

Sample Execution Output

$ pytest test_pairwise.py -v

collected 9 items

test_pairwise.py::test_web_app_cross_env[Windows-Chrome-English]   PASSED
test_pairwise.py::test_web_app_cross_env[Windows-Firefox-Japanese] PASSED
test_pairwise.py::test_web_app_cross_env[Windows-Safari-Korean]    PASSED
test_pairwise.py::test_web_app_cross_env[macOS-Chrome-Japanese]    PASSED
test_pairwise.py::test_web_app_cross_env[macOS-Firefox-Korean]     PASSED
test_pairwise.py::test_web_app_cross_env[macOS-Safari-English]     PASSED
test_pairwise.py::test_web_app_cross_env[Linux-Chrome-Korean]      PASSED
test_pairwise.py::test_web_app_cross_env[Linux-Firefox-English]    PASSED
test_pairwise.py::test_web_app_cross_env[Linux-Safari-Japanese]    PASSED

========================== 9 passed in 0.08s ===========================

💡 Pro Tip: Designing your code to pass allpairspy output directly to parametrize means adding or changing a parameter only requires updating the parameter list — the rest of the code stays unchanged. Managing the parameter list in a config file or fixture takes maintainability even further.

Where Pairwise Testing Is Most Effective

Here’s a summary of the production scenarios where pairwise testing delivers the most value.

Scenario	Example Parameters	Reduction
Cross-browser / OS testing	OS × Browser × Screen size	27 → 9 cases (67% reduction)
Settings screen testing	Theme × Font size × Language × Notifications	81 → 12 cases (85% reduction)
API parameter testing	Sort order × Filter × Page size × Format	Significant reduction
Playwright device testing	Device × Network speed × Locale	27 → 9 cases (67% reduction)

⚠️ 4 Common Pitfalls in Pairwise Testing

Here are the mistakes that are easy to make when applying pairwise testing in practice.

① Applying pairwise to parameters with strong dependencies

If Safari only runs on macOS, pairwise may generate “Windows × Safari” — a combination that can’t actually occur. When parameters have strong dependencies between them, a decision table that explicitly models those relationships is the safer choice.

② Using pairwise when 3+ parameter interactions are critical

Pairwise covers every 2-parameter pair, but bugs that only surface when 3 or more parameters all have specific values simultaneously may be missed. For safety-critical systems or cases where higher-order interactions matter, consider 3-wise or 4-wise coverage instead.

③ Trying to replace all tests with pairwise testing alone

Pairwise testing optimizes combination testing — it doesn’t replace unit tests, equivalence partitioning, or boundary value analysis. Use pairwise for combination coverage, equivalence partitioning and boundary value analysis for input range verification. Each technique has its place.

④ Trusting the tool output without reviewing the generated cases

Don’t pipe allpairspy output straight into your test suite without checking it. Always verify that impossible combinations (e.g. macOS × Internet Explorer) aren’t included, and that no important pairs have been dropped. A quick manual review before automating is always worth it.

FAQ

Q. When should I use pairwise testing?

Use it when you have 5 or more parameters and testing every combination is impractical. It’s especially effective for environment configuration tests with independent parameters like OS, browser, device, language, and screen size. For 2–4 conditions with dependencies between them, decision tables are usually a better fit.

Q. How do I choose between pairwise testing and decision tables?

Use decision tables when you have 2–5 conditions, business logic, and dependencies between conditions. Use pairwise testing when you have 5+ parameters, environment configuration, and independent parameters. In practice both are often combined — decision tables for business logic, pairwise for cross-environment testing.

Q. Does pairwise testing guarantee 100% bug detection?

No. Pairwise testing finds more bugs more efficiently than full combination testing — it doesn’t provide a 100% guarantee. Bugs that only appear when 3 or more parameters all have specific values simultaneously may be missed. For high-risk systems, consider combining with higher-factor coverage (3-wise, 4-wise) or full combination testing for the most critical parameter sets.

Q. Is pairwise testing covered in ISTQB?

Yes. ISTQB Foundation Level includes pairwise testing (2-way coverage) in its scope. Understanding “the difference from full combination testing,” “what 2-way coverage means,” and “when to apply it” is sufficient for the exam. The Advanced Level goes into more depth.

📖 Related Articles

In production, a common workflow is to generate cross-browser test cases with allpairspy and pass them directly to Playwright × pytest’s parametrize. Because adding or changing parameters only requires updating the parameter list, the maintenance cost of both the test design and the implementation is kept to a minimum.

📋 Summary

Pairwise testing is a test design technique that guarantees every pair of parameter values is tested at least once
3 parameters × 3 values = 27 cases reduced to ~9 cases — while maintaining complete pair coverage
Python’s allpairspy auto-generates the optimal test set; pass it to parametrize for instant automated tests
Most effective when parameters are independent and there are 5 or more of them (use decision tables when there are dependencies)
For cases where 3+ parameter interactions matter, consider higher-factor coverage alongside pairwise

Pairwise testing is the practical answer to “we want to test everything but we don’t have the time.” Start by installing allpairspy and applying it to a cross-environment test or settings screen you already work with. Watching 27 cases collapse to 9 will change the way you think about test design.