哈希冲突的解决方法

哈希冲突是指两个不同的键通过哈希函数计算得到相同的哈希值时发生的情况。解决哈希冲突是哈希表设计中的核心问题，常见的解决方法有以下几种：

1. 开放地址法（Open Addressing）

开放地址法的基本思想是：当发生冲突时，在哈希表中寻找下一个可用的空槽位来存储冲突的元素。

1.1 线性探测（Linear Probing）

线性探测是最简单的开放地址法，当发生冲突时，顺序查找下一个空槽位。

function linearProbeInsert(hashTable, key, value) {
  let index = hash(key) % hashTable.length;
  
  // 当槽位被占用时，线性查找下一个空位
  while (hashTable[index] !== null) {
    index = (index + 1) % hashTable.length; // 循环到表头
  }
  
  hashTable[index] = { key, value };
  return index;
}

优点：实现简单，缓存友好。缺点：容易产生聚集（clustering）现象，降低查找效率。

1.2 二次探测（Quadratic Probing）

二次探测使用二次函数来确定下一个探测位置，减少聚集现象。

function quadraticProbeInsert(hashTable, key, value) {
  let index = hash(key) % hashTable.length;
  let i = 0;
  
  // 当槽位被占用时，使用二次函数查找下一个空位
  while (hashTable[index] !== null) {
    i++;
    index = (hash(key) + i * i) % hashTable.length;
  }
  
  hashTable[index] = { key, value };
  return index;
}

优点：减少聚集现象。缺点：不能保证找到所有空槽位，可能需要表大小为特定质数。

1.3 双重哈希（Double Hashing）

双重哈希使用两个哈希函数，当发生冲突时，使用第二个哈希函数计算步长。

function doubleHashInsert(hashTable, key, value) {
  let index = hash1(key) % hashTable.length;
  const step = hash2(key) % hashTable.length;
  
  // 确保步长不为0
  if (step === 0) {
    return -1; // 错误
  }
  
  // 当槽位被占用时，使用第二个哈希函数计算步长
  while (hashTable[index] !== null) {
    index = (index + step) % hashTable.length;
  }
  
  hashTable[index] = { key, value };
  return index;
}

优点：减少聚集，分布更均匀。缺点：计算成本较高，需要两个哈希函数。

2. 链地址法（Separate Chaining）

链地址法在哈希表的每个槽位维护一个链表（或其他数据结构），所有映射到同一位置的元素都存储在这个链表中。

class HashTableWithChaining {
  constructor(size) {
    this.size = size;
    this.table = new Array(size).fill(null).map(() => []);
  }
  
  insert(key, value) {
    const index = hash(key) % this.size;
    const bucket = this.table[index];
    
    // 检查键是否已存在
    for (let i = 0; i < bucket.length; i++) {
      if (bucket[i].key === key) {
        bucket[i].value = value; // 更新值
        return;
      }
    }
    
    // 添加新元素
    bucket.push({ key, value });
  }
  
  get(key) {
    const index = hash(key) % this.size;
    const bucket = this.table[index];
    
    for (let i = 0; i < bucket.length; i++) {
      if (bucket[i].key === key) {
        return bucket[i].value;
      }
    }
    
    return undefined; // 未找到
  }
}

优点：实现简单，不会因为冲突而丢失数据，适合数据量不确定的情况。缺点：需要额外的指针存储空间，链表过长时性能下降。

3. 再哈希法（Rehashing）

当哈希表的负载因子（元素数量/表大小）超过某个阈值时，创建一个更大的新表，并将所有元素重新哈希到新表中。

class HashTableWithRehashing {
  constructor(initialSize = 8) {
    this.size = initialSize;
    this.count = 0;
    this.threshold = 0.75; // 负载因子阈值
    this.table = new Array(initialSize).fill(null);
  }
  
  insert(key, value) {
    // 检查是否需要扩容
    if (this.count / this.size >= this.threshold) {
      this._resizeAndRehash();
    }
    
    let index = hash(key) % this.size;
    
    // 线性探测
    while (this.table[index] !== null && this.table[index].key !== key) {
      index = (index + 1) % this.size;
    }
    
    // 新元素或更新现有元素
    if (this.table[index] === null) {
      this.count++;
    }
    
    this.table[index] = { key, value };
  }
  
  _resizeAndRehash() {
    const oldTable = this.table;
    this.size *= 2; // 双倍扩容
    this.table = new Array(this.size).fill(null);
    this.count = 0;
    
    // 重新哈希所有元素
    for (let i = 0; i < oldTable.length; i++) {
      if (oldTable[i] !== null) {
        this.insert(oldTable[i].key, oldTable[i].value);
      }
    }
  }
}

优点：保持哈希表的性能，避免过度冲突。缺点：再哈希操作成本高，需要暂时停止操作。

4. 建立公共溢出区（Overflow Area）

将哈希表分为基本表和溢出区，所有冲突的元素都放入溢出区。

class HashTableWithOverflow {
  constructor(size) {
    this.size = size;
    this.table = new Array(size).fill(null);
    this.overflowArea = []; // 公共溢出区
  }
  
  insert(key, value) {
    const index = hash(key) % this.size;
    
    // 槽位为空或键已存在
    if (this.table[index] === null || this.table[index].key === key) {
      this.table[index] = { key, value };
      return;
    }
    
    // 冲突，放入溢出区
    this.overflowArea.push({ key, value });
  }
  
  get(key) {
    const index = hash(key) % this.size;
    
    // 检查基本表
    if (this.table[index] !== null && this.table[index].key === key) {
      return this.table[index].value;
    }
    
    // 检查溢出区
    for (let item of this.overflowArea) {
      if (item.key === key) {
        return item.value;
      }
    }
    
    return undefined; // 未找到
  }
}

优点：实现简单，基本表结构清晰。缺点：溢出区查找效率低，不适合大量冲突的情况。

哈希冲突解决方法比较

方法	时间复杂度(平均)	时间复杂度(最坏)	空间复杂度	适用场景
线性探测	O(1)	O(n)	O(n)	负载因子小，删除操作少
二次探测	O(1)	O(n)	O(n)	负载因子小，表大小为质数
双重哈希	O(1)	O(n)	O(n)	需要均匀分布，计算资源充足
链地址法	O(1 + α)	O(n)	O(n + m)	数据量不确定，冲突较多
再哈希法	O(1)	O(n)	O(n)	需要保持性能，可接受偶尔的停顿
公共溢出区	O(1)	O(m)	O(n + m)	冲突较少，溢出区小

注：α是负载因子，m是溢出区大小。

哈希冲突解决流程图

--- title: 哈希冲突解决流程 --- flowchart TD A[插入键值对] --> B[计算哈希值] B --> C{槽位是否为空?} C -->|是| D[直接存储] C -->|否| E{选择冲突解决策略} E -->|开放地址法| F[寻找下一个空槽位] E -->|链地址法| G[添加到链表] E -->|再哈希法| H[扩容并重新哈希] E -->|公共溢出区| I[放入溢出区] F --> D G --> J[完成插入] H --> D I --> J D --> J

哈希冲突解决方法选择

选择哪种冲突解决方法取决于具体的应用场景和需求：

内存受限：链地址法需要额外的指针存储空间，开放地址法更节省内存。
删除操作频繁：开放地址法中的删除操作较复杂，链地址法更适合。
数据量不确定：链地址法更灵活，可以动态处理不确定数量的元素。
性能要求高：双重哈希或再哈希法可以提供更均匀的分布和更好的性能。
实现简单：线性探测或链地址法实现相对简单。

总的来说，链地址法是最常用的冲突解决方法，因为它实现简单、适应性强，并且在实践中表现良好。Java中的HashMap、C++中的unordered_map等标准库实现都采用了链地址法（结合再哈希）来解决冲突。

参考资料

account_tree

思维导图

Interview AiBoxInterview AiBox — 面试搭档

不只是准备，更是实时陪练

Interview AiBox 在面试过程中提供实时屏幕提示、AI 模拟面试和智能复盘，让你每一次回答都更有信心。

免费下载 Interview AiBoxdownload 查看价格方案sell

AI 助读

一键发送到常用 AI

哈希冲突是指不同键通过哈希函数得到相同值的情况。主要解决方法包括：1）开放地址法（线性探测、二次探测、双重哈希），在表中寻找下一个空位；2）链地址法，每个槽位维护一个链表存储冲突元素；3）再哈希法，当负载因子过高时扩容并重新哈希；4）公共溢出区，将冲突元素放入专门区域。链地址法是最常用的方法，因其实现简单、适应性强，Java HashMap、C++ unordered_map等标准库都采用此方法。

智能总结

深度解读

考点定位

思路启发