Pigeonhole Algorithm In C Programming

Last Updated :29 Dec, 2025

Problem Statement

Many programming challenges involve working with data sets where elements fall within a limited range, but the total number of elements exceeds that range. For instance, you might need to find duplicates in an array, count the frequency of items, or prove the existence of a specific pattern given certain constraints. Without an understanding of principles like the Pigeonhole Principle, these problems might lead to complex, inefficient solutions involving extensive sorting or searching. The challenge lies in identifying when and how this principle can simplify the logic and improve performance.

Example

Consider an array of 5 integers, where each integer is guaranteed to be between 1 and 4, inclusive. The Pigeonhole Principle tells us that since we have 5 numbers (pigeons) and only 4 possible unique values (pigeonholes), at least one number must be repeated.

Here's a simple C program that demonstrates finding such a duplicate:

// Pigeonhole Duplicate Example
#include <stdio.h>
#include <stdbool.h> // For boolean type

int main() {
    // Step 1: Define the array of numbers
    int numbers[] = {3, 1, 4, 1, 2};
    int n = sizeof(numbers) / sizeof(numbers[0]); // Number of elements

    // Step 2: Define the range of possible values (pigeonholes)
    // Numbers are between 1 and 4. We need an array of size 5 to use indices 1-4.
    bool seen[5] = {false}; // Initialize all to false. Index 0 is unused.

    printf("Numbers in array: ");
    for (int i = 0; i < n; i++) {
        printf("%d ", numbers[i]);
    }
    printf("\\n");

    // Step 3: Iterate through the numbers and mark them as seen
    printf("Checking for duplicates...\\n");
    for (int i = 0; i < n; i++) {
        int current_number = numbers[i];
        if (seen[current_number]) { // If this number has been seen before
            printf("Duplicate found: %d\\n", current_number);
            return 0; // Exit after finding the first duplicate
        }
        seen[current_number] = true; // Mark this number as seen
    }

    printf("No duplicates found (this case should not happen given the problem setup).\\n");
    return 0;
}

Run

Output:

Numbers in array: 3 1 4 1 2 
Checking for duplicates...
Duplicate found: 1

Background & Knowledge Prerequisites

To effectively understand and apply the Pigeonhole Principle in C, readers should be familiar with:

C Language Basics: Variables, data types, control structures (loops, conditionals), arrays.
Arrays: Understanding how to declare, initialize, and access elements in arrays, which often serve as "pigeonholes" for counting or marking presence.
Boolean Logic: Basic understanding of true/false values and their use in conditions.
Mathematical Reasoning: An intuitive grasp of basic counting and logical deduction.

Use Cases or Case Studies

The Pigeonhole Principle, while simple, is a powerful tool in various programming scenarios:

Duplicate Detection in Limited Ranges: Finding repeating elements in an array where elements are known to fall within a small, defined range (e.g., numbers from 1 to 100 in an array of 101 elements).
Frequency Counting: Efficiently determining the occurrence of each item in a collection by using an array or hash map as "pigeonholes" where indices correspond to items.
Collision Detection: In hashing, the principle explains why collisions are inevitable if the number of items to hash exceeds the number of available hash slots.
Existence Proofs: Proving that certain conditions *must* be met (e.g., in a group of any 13 people, at least two must have been born in the same month).
Optimizing Space and Time: Designing algorithms that use constant extra space or achieve linear time complexity by leveraging the known bounds of input data.

Solution Approaches

While the Pigeonhole Principle isn't a "step-by-step algorithm" in itself, it guides the design of efficient algorithms. Here, we demonstrate C implementations that embody its spirit for common problems.

Approach 1: Detecting Duplicates in a Bounded Array

This approach directly applies the Pigeonhole Principle: if you have n items and m distinct possible values where n > m, then at least one value must be repeated. We use a boolean array as our "pigeonholes" to track seen values.

Summary: Check for duplicates in an array whose elements are within a known, small positive range by marking seen values.

// Detect Duplicates Using Pigeonholes
#include <stdio.h>
#include <stdbool.h>
#include <limits.h> // For INT_MAX/INT_MIN if needed, though not strictly for this example

// Function to find a duplicate in an array with elements in range [min_val, max_val]
int find_duplicate(int arr[], int n, int min_val, int max_val) {
    if (n <= 1) { // No duplicates possible in empty or single-element array
        return -1; // Indicate no duplicate
    }

    // Calculate the number of possible unique values (pigeonholes)
    int range_size = max_val - min_val + 1;

    // The pigeonhole principle applies if n > range_size
    if (n <= range_size) {
        // If number of items <= number of pigeonholes, duplicates are not guaranteed
        // A more general duplicate check (e.g., sorting, hashing) would be needed
        // For this specific problem, we assume n > range_size
        printf("Note: Pigeonhole Principle guarantees a duplicate only if N > (Max - Min + 1).\\n");
        printf("Current N: %d, Range Size: %d\\n", n, range_size);
        // Fall through to check anyway, as a duplicate might still exist.
    }

    // Create a boolean array to mark if a value has been seen.
    // Size is `range_size + 1` to use 1-based indexing for values or handle 0 correctly.
    // For values from `min_val` to `max_val`, map them to indices `0` to `range_size - 1`.
    bool *seen = (bool *)calloc(range_size, sizeof(bool)); // Initialize all to false

    if (seen == NULL) {
        printf("Memory allocation failed!\\n");
        return -1;
    }

    for (int i = 0; i < n; i++) {
        int val = arr[i];
        if (val < min_val || val > max_val) {
            printf("Error: Value %d out of expected range [%d, %d]\\n", val, min_val, max_val);
            free(seen);
            return -1;
        }

        // Map the value to an index in the 'seen' array
        int index = val - min_val;

        if (seen[index]) {
            free(seen);
            return val; // Duplicate found
        }
        seen[index] = true; // Mark as seen
    }

    free(seen);
    return -1; // No duplicate found (shouldn't happen if n > range_size)
}

int main() {
    int numbers1[] = {10, 2, 8, 5, 2, 7, 3, 10, 1}; // 9 numbers, range 1-10
    int n1 = sizeof(numbers1) / sizeof(numbers1[0]);
    int min1 = 1, max1 = 10; // Range of values

    printf("Array 1: ");
    for(int i=0; i<n1; ++i) printf("%d ", numbers1[i]);
    printf("\\n");
    int duplicate1 = find_duplicate(numbers1, n1, min1, max1);
    if (duplicate1 != -1) {
        printf("Duplicate found in Array 1: %d\\n", duplicate1);
    } else {
        printf("No duplicate found in Array 1.\\n");
    }
    printf("\\n");

    int numbers2[] = {5, 3, 8, 1, 4, 9, 2, 7, 6}; // 9 numbers, range 1-9
    int n2 = sizeof(numbers2) / sizeof(numbers2[0]);
    int min2 = 1, max2 = 9; // Range of values
    
    printf("Array 2: ");
    for(int i=0; i<n2; ++i) printf("%d ", numbers2[i]);
    printf("\\n");
    int duplicate2 = find_duplicate(numbers2, n2, min2, max2);
    if (duplicate2 != -1) {
        printf("Duplicate found in Array 2: %d\\n", duplicate2);
    } else {
        printf("No duplicate found in Array 2.\\n");
    }
    printf("\\n");

    int numbers3[] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1}; // 11 numbers, range 1-10
    int n3 = sizeof(numbers3) / sizeof(numbers3[0]);
    int min3 = 1, max3 = 10; // Range of values

    printf("Array 3: ");
    for(int i=0; i<n3; ++i) printf("%d ", numbers3[i]);
    printf("\\n");
    int duplicate3 = find_duplicate(numbers3, n3, min3, max3);
    if (duplicate3 != -1) {
        printf("Duplicate found in Array 3: %d\\n", duplicate3);
    } else {
        printf("No duplicate found in Array 3.\\n");
    }
    printf("\\n");

    return 0;
}

Run

Sample Output:

Array 1: 10 2 8 5 2 7 3 10 1 
Note: Pigeonhole Principle guarantees a duplicate only if N > (Max - Min + 1).
Current N: 9, Range Size: 10
Duplicate found in Array 1: 2

Array 2: 5 3 8 1 4 9 2 7 6 
No duplicate found in Array 2.

Array 3: 1 2 3 4 5 6 7 8 9 10 1 
Duplicate found in Array 3: 1

Stepwise Explanation:

Define Range and 'Pigeonholes': We determine the min_val and max_val to define the range of possible numbers. An auxiliary boolean array seen is created with a size equal to max_val - min_val + 1. Each index in seen corresponds to a unique number in our range, acting as a "pigeonhole."
Initialize Pigeonholes: All elements of the seen array are initialized to false, indicating that no number has been encountered yet.
Iterate and Mark: For each number in the input arr:

It's mapped to an index in the seen array by val - min_val.
If seen[index] is already true, it means we've encountered this number before, so a duplicate is found. The function returns this duplicate.
If seen[index] is false, we mark it true, indicating that this number has now been seen.

No Duplicate (Rare Case): If the loop completes without finding a duplicate, it returns -1. According to the Pigeonhole Principle, this should only happen if the number of elements n is not greater than the range size.

Approach 2: Character Frequency Counting

This approach uses the Pigeonhole Principle to count the frequency of characters in a string. Each unique character is a "pigeonhole," and the count of each character is the number of "pigeons" in that hole.

Summary: Count the occurrences of each character in a string using an array as frequency "pigeonholes."

// Character Frequency Counter
#include <stdio.h>
#include <string.h> // For strlen()

// Define a constant for the number of possible ASCII characters
#define ASCII_SIZE 256

int main() {
    char str[] = "hello world";
    int i = 0;

    // Create an array to store the frequency of each character (our pigeonholes)
    int freq[ASCII_SIZE] = {0}; // Initialize all counts to 0

    printf("Original string: \\"%s\\"\\n", str);

    // Iterate through the string and increment the count for each character
    while (str[i] != '\\0') {
        // Use the character's ASCII value as an index (pigeonhole)
        freq[(unsigned char)str[i]]++;
        i++;
    }

    printf("Character frequencies:\\n");
    // Print the frequency of characters that appeared
    for (i = 0; i < ASCII_SIZE; i++) {
        if (freq[i] > 0) {
            printf("  '%c' : %d\\n", (char)i, freq[i]);
        }
    }

    return 0;
}

Run

Sample Output:

Original string: "hello world"
Character frequencies:
  ' ' : 1
  'd' : 1
  'e' : 1
  'h' : 1
  'l' : 3
  'o' : 2
  'r' : 1
  'w' : 1

Stepwise Explanation:

Define Pigeonholes: An integer array freq of size ASCII_SIZE (256 for standard ASCII characters) is declared. Each index i in freq represents a unique ASCII character (our pigeonhole), and freq[i] stores its count.
Initialize Counts: All elements of freq are initialized to 0.
Populate Pigeonholes: The program iterates through the input string. For each character str[i]:

Its ASCII value is used as an index into the freq array.
freq[str[i]] is incremented. This means a "pigeon" (the character) is placed into its corresponding "pigeonhole" (the frequency counter for that character).

Display Results: After processing the entire string, the freq array contains the total count for each character. The program then iterates through freq and prints the counts for characters that appeared at least once (freq[i] > 0).

Conclusion

The Pigeonhole Principle, while simple in its statement, provides an elegant framework for solving certain classes of problems in C programming. It allows us to infer the existence of duplicates, patterns, or specific conditions without exhaustive searching, often leading to more efficient algorithms. By mapping problem elements to "pigeons" and their possible categories to "pigeonholes," we can leverage this principle for tasks like duplicate detection and frequency counting, resulting in clear and performant code.

Summary

The Pigeonhole Principle states that if you have more items than containers, at least one container must hold more than one item.
In C programming, this principle is implicitly applied in problems involving:
Duplicate detection in arrays with bounded element ranges.
Frequency counting of characters or numbers.
Existence proofs for certain conditions.
It often leads to efficient solutions (e.g., O(N) time complexity) by using auxiliary arrays as "pigeonholes" to mark presence or count occurrences.
Understanding the principle helps in designing algorithms that are both correct and performant, especially when dealing with constraints on input data.

Hope, This article was helpful?

Sorry about that

How can we improve it?

Thank you for helping us improve this article.

Pigeonhole Algorithm In C Programming

Problem Statement

Example

Background & Knowledge Prerequisites

Use Cases or Case Studies

Solution Approaches

Approach 1: Detecting Duplicates in a Bounded Array

Approach 2: Character Frequency Counting

Conclusion

Summary

Sorry about that

Recent Articles

Related Articles

C Program Of Simple Bubble Sort Implementation Using Array Ascending Order...

C Program For Insertion Sort In Descending Order...

C Program For Selection Sort Algorithm In Data Structure...

C Program To Implement Oddeven Sorting...

C Program For Selection Sort In Descending Order...

C Program For Shell Sort In Data Structure...

Related Articles

C Program Of Simple Bubble Sort Implementation Using Array Ascending Order...

C Program For Insertion Sort In Descending Order...

C Program For Selection Sort Algorithm In Data Structure...

C Program To Implement Oddeven Sorting...

C Program For Selection Sort In Descending Order...

C Program For Shell Sort In Data Structure...