Linear Probing: Analysis

Analyze the efficiency of "open address" hash tables.

If there are no collisions, insert/remove and search performance is $\Omicron(1)$ ; this is the best-case scenario.

The worst-case scenario is when the probing sequence goes over every occupied position. This scenario leads to $\Omicron(n)$ performance where $n$ is the number of items stored in the hash table.

Given an open-address hash table with load factor $\alpha$ , the expected number of probes in an unsuccessful search (or for inserting an element) is at most:

$\frac{1}{1 - \alpha}$

The statement above can be proved by assuming the hash table employs a uniform hashing function. The proof is beyond the scope of this class. Interested reader is referred to the CLRS: Theorem $11.6$ and Corollary $11.7$ .

The take-home message: since $\alpha < 1$ in open-address hash tables, the average (expected) performance is constant time under the assumption of usinform hashing.

Resources

Wikipedia entry on Hash Table: Performance.

Data Structures

Linear Probing: Analysis