C++ 哈希表原理与实战：从概念到实现 | 极客日志

C++算法

C++ 哈希表原理与实战：从概念到实现

C++ 哈希表通过哈希函数将关键字映射到存储位置，实现高效查找。涵盖哈希概念、冲突处理（开放定址法、链地址法）、哈希函数设计及扩容策略，并提供基于 C++ 的完整代码实现，帮助开发者掌握哈希在实际开发中的应用。

Qiny01发布于 2026/2/4更新于 2026/6/22.7K 浏览

前言：

在 C++ 编程中，高效的数据查找和存储是核心需求之一。哈希（Hash）技术凭借其近似 O(1) 的查找效率，成为解决这一需求的重要手段。本文将从哈希的基本概念出发，逐步深入 C++ 中的哈希实现，包括 STL 哈希容器、哈希函数设计、哈希冲突解决等关键知识点，并结合实例帮助大家掌握哈希在实际开发中的应用。

一、哈希的概念

哈希 (hash) 又称散列，是一种组织数据的方式。从译名来看，有散乱排列的意思。本质就是通过哈希函数把关键字 Key 跟存储位置建立一个映射关系，查找时通过这个哈希函数计算出 Key 存储的位置，进行快速查找。

1.1 直接定址法

当关键字的范围比较集中时，直接定址法就是非常简单高效的方法，比如一组关键字都在 [0,99] 之间，那么我们开一个 100 个数的数组，每个关键字的值直接就是存储位置的下标。再比如一组关键字值都在 [a,z] 的小写字母，那么我们开一个 26 个数的数组，每个关键字 ascii 码 - a ascii 码就是存储位置的下标。也就是说直接定址法本质就是用关键字计算出一个绝对位置或者相对位置。这个方法我们在计数排序部分已经用过了，其次在 string 章节的下面 OJ 也用过了。

示例：

字符串中的第一个唯一字符 - LeetCode

C++ 代码实现（哈希）：

class Solution {
public:
    int firstUniqChar(string s) {
        // 每个字母的 ascii 码-'a'的 ascii 码作为下标映射到 count 数组，数组中存储出现的次数
        int count[26]={0}; 
        // 统计次数
        for(auto ch:s) { 
            count[ch-'a']++; 
        } 
        for(size_t i=0;i<s.size();++i) { 
            if(count[s[i]-'a']==1) return i; 
        } 
        return -1; 
    }
};

1.2 哈希冲突

直接定址法的缺点非常明显，当关键字的范围比较分散时，会很浪费内存甚至内存不够用。假设我们只有数据范围是 [0,9999] 的 N 个值，要映射到一个 M 个空间的数组中（一般情况下 M >= N），那么就要借助哈希函数 (hash function) hf，关键字 key 被放到数组的 h(key) 位置，这里要注意的是 h(key) 计算出的值必须在 [0, M) 之间。

这里存在的一个问题就是，两个不同的 key 可能会映射到同一个位置去，这种问题我们叫做哈希冲突，或者哈希碰撞。理想情况是找出一个好的哈希函数避免冲突，但是实际场景中，冲突是不可避免的，所以我们尽可能设计出优秀的哈希函数，减少冲突的次数，同时也要去设计出解决冲突的方案。

1.3 负载因子

假设哈希表中已经映射存储了 N 个值，哈希表的大小为 M，那么 负载因子 = N / M，负载因子有些地方也翻译为载荷因子 / 装载因子等，其英文为 load factor。。

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
Base64 字符串编码/解码
将字符串编码和解码为其 Base64 格式表示形式即可。在线工具，Base64 字符串编码/解码在线工具，online
Base64 文件转换器
将字符串、文件或图像转换为其 Base64 表示形式。在线工具，Base64 文件转换器在线工具，online
Markdown转HTML
将 Markdown（GFM）转为 HTML 片段，浏览器内 marked 解析；与 HTML转Markdown 互为补充。在线工具，Markdown转HTML在线工具，online
HTML转Markdown
将 HTML 片段转为 GitHub Flavored Markdown，支持标题、列表、链接、代码块与表格等；浏览器内处理，可链接预填。在线工具，HTML转Markdown在线工具，online

enum State { EXIST, EMPTY, DELETE }; 
template<class K, class V> struct HashData { 
    pair<K, V> _kv; 
    State _state = EMPTY; 
}; 
template<class K, class V> class HashTable { 
private: 
    vector<HashData<K, V>> _tables; 
    size_t _n = 0; // 表中存储数据个数 
};

static const int __stl_num_primes = 28; 
static const unsigned long __stl_prime_list[__stl_num_primes] = { 
    53, 97, 193, 389, 769, 1543, 3079, 6151, 12289, 24593, 49157, 98317, 
    196613, 393241, 786433, 1572869, 3145739, 6291469, 12582917, 25165843, 
    50331653, 100663319, 201326611, 402653189, 805306457, 1610612741, 
    3221225473, 4294967291 
}; 
inline unsigned long __stl_next_prime(unsigned long n) { 
    const unsigned long* first = __stl_prime_list; 
    const unsigned long* last = __stl_prime_list + __stl_num_primes; 
    // >= n 
    const unsigned long* pos = lower_bound(first, last, n); 
    return pos == last ? *(last - 1) : *pos; 
}

template<class K> struct HashFunc { 
    size_t operator()(const K& key) { 
        return (size_t)key; 
    } 
}; 
// 特化 
template<> struct HashFunc<string> { 
    // 字符串转换成整形，可以把字符 ascii 码相加即可 
    // 但是直接相加的话，类似"abcd"和"bcad"这样的字符串计算出是相同的 
    // 这里我们使用 BKDR 哈希的思路，用上次的计算结果去乘以一个质数， 
    // 这个质数一般去 31, 131 等效果会比较好 
    size_t operator()(const string& key) { 
        size_t hash = 0; 
        for (auto ch : key) { 
            hash += ch; 
            hash *= 131; 
        } 
        return hash; 
    } 
}; 
template<class K, class V, class Hash = HashFunc<K>> class HashTable { 
public: 
private: 
    vector<HashData<K, V>> _tables; 
    size_t _n = 0; // 表中存储数据个数 
};

template<class T> struct HashNode { 
    T _data; 
    HashNode<T>* _next; 
    HashNode(const T& data) :_data(data) , _next(nullptr) { } 
};

pair<Iterator, bool> Insert(const T& data) { 
    KeyOfT kot; 
    if (auto it = Find(kot(data)); it != End()) return { it, false }; 
    Hash hs; 
    // 负载因子 == 1 就开始扩容 
    if (_n == _tables.size()) { 
        std::vector<Node*> newtables(__stl_next_prime(_tables.size() + 1), nullptr); 
        for (size_t i = 0; i < _tables.size(); i++) { 
            // 遍历旧表，旧表节点重新映射，挪动到新表 
            Node* cur = _tables[i]; 
            while (cur) { 
                Node* next = cur->_next; 
                // 头插 
                size_t hashi = hs(kot(cur->_data)) % newtables.size(); 
                cur->_next = newtables[hashi]; 
                newtables[hashi] = cur; 
                cur = next; 
            } 
            _tables[i] = nullptr; 
        } 
        _tables.swap(newtables); 
    } 
    size_t hashi = hs(kot(data)) % _tables.size(); 
    // 头插 
    Node* newnode = new Node(data); 
    newnode->_next = _tables[hashi]; 
    _tables[hashi] = newnode; 
    ++_n; 
    return { Iterator(newnode, this), true }; 
}

Iterator Find(const K& key) { 
    KeyOfT kot; 
    Hash hs; 
    size_t hashi = hs(key) % _tables.size(); 
    Node* cur = _tables[hashi]; 
    while (cur) { 
        if (kot(cur->_data) == key) { 
            return { cur, this }; 
        } 
        cur = cur->_next; 
    } 
    return { nullptr, this }; 
}

bool Erase(const K& key) { 
    KeyOfT kot; 
    Hash hs; 
    size_t hashi = hs(key) % _tables.size(); 
    Node* prev = nullptr; 
    Node* cur = _tables[hashi]; 
    while (cur) { 
        if (kot(cur->_data) == key) { 
            // 删除 
            if (prev == nullptr) { 
                // 桶中第一个节点 
                _tables[hashi] = cur->_next; 
            } else { 
                prev->_next = cur->_next; 
            } 
            --_n; 
            delete cur; 
            return true; 
        } 
        prev = cur; 
        cur = cur->_next; 
    } 
    return false; 
}

#include<vector>
static const int __stl_num_primes = 28;
static const unsigned long __stl_prime_list[__stl_num_primes] = { 
    53, 97, 193, 389, 769, 1543, 3079, 6151, 12289, 24593, 49157, 98317, 
    196613, 393241, 786433, 1572869, 3145739, 6291469, 12582917, 25165843, 
    50331653, 100663319, 201326611, 402653189, 805306457, 1610612741, 
    3221225473, 4294967291 
};
inline unsigned long __stl_next_prime(unsigned long n) { 
    const unsigned long* first = __stl_prime_list;
    const unsigned long* last = __stl_prime_list + __stl_num_primes;
    // >= n 
    const unsigned long* pos = lower_bound(first, last, n);
    return pos == last ? *(last - 1) : *pos;
}
template<class K> struct HashFunc { 
    size_t operator()(const K& key) { 
        return (size_t)key; 
    } 
}; 
// 特化 
template<> struct HashFunc<string> { 
    // 字符串转换成整形，可以把字符 ascii 码相加即可 
    // 但是直接相加的话，类似"abcd"和"bcad"这样的字符串计算出是相同的 
    // 这里我们使用 BKDR 哈希的思路，用上次的计算结果去乘以一个质数， 
    // 这个质数一般去 31, 131 等效果会比较好 
    size_t operator()(const string& key) { 
        size_t hash = 0; 
        for (auto ch : key) { 
            hash += ch; 
            hash *= 131; 
        } 
        return hash; 
    } 
}; 
namespace hash_bucket { 
    template<class T> struct HashNode { 
        T _data; 
        HashNode<T>* _next; 
        HashNode(const T& data) :_data(data) , _next(nullptr) { } 
    }; 
    // 前置声明 
    template<class K, class T, class KeyOfT, class Hash> class HashTable; 
    template<class K, class T, class Ref, class Ptr, class KeyOfT, class Hash> struct HTIterator { 
        typedef HashNode<T> Node; 
        typedef HashTable<K, T, KeyOfT, Hash> HT; 
        typedef HTIterator<K, T, Ref, Ptr, KeyOfT, Hash> Self; 
        Node* _node; 
        const HT* _pht; 
        HTIterator(Node* node, const HT* pht) :_node(node) , _pht(pht) { } 
        Ref operator*() { return _node->_data; } 
        Ptr operator->() { return &_node->_data; } 
        Self& operator++() {//9:11 
            if (_node->_next) // 当前桶没走完 
            { 
                _node = _node->_next; 
            } else // 当前桶走完了，找到下一个桶的第一个节点 
            { 
                KeyOfT kot; 
                Hash hs; 
                // 算出当前桶的位置 
                size_t hashi = hs(kot(_node->_data)) % _pht->_tables.size(); 
                ++hashi; 
                while (hashi < _pht->_tables.size()) { 
                    if (_pht->_tables[hashi]) // 找到下一个不为空的桶 
                    { 
                        _node = _pht->_tables[hashi]; 
                        break; 
                    } else { 
                        ++hashi; 
                    } 
                } 
                if (hashi == _pht->_tables.size()) // 最后一个桶走完了，要++到 end() 位置 
                { 
                    // end() 中_node 是空 
                    _node = nullptr; 
                } 
            } 
            return *this; 
        } 
        bool operator!=(const Self& s) const { return _node != s._node; } 
        bool operator==(const Self& s) const { return _node == s._node; } 
    }; 
    // hash_bucket::HashTable<K, pair<K, V>, MapKeyOfT> _ht; 
    // hash_bucket::HashTable<K, K, SetKeyOfT> _ht; 
    template<class K, class T, class KeyOfT, class Hash> class HashTable { 
        // 友元声明 
        template<class K, class T, class Ref, class Ptr, class KeyOfT, class Hash> friend struct HTIterator; 
        typedef HashNode<T> Node; 
    public: 
        typedef HTIterator<K, T, T&, T*, KeyOfT, Hash> Iterator; 
        typedef HTIterator<K, T, const T&, const T*, KeyOfT, Hash> ConstIterator; 
        Iterator Begin() { 
            if (_n == 0) { return End(); } 
            for (size_t i = 0; i < _tables.size(); i++) { 
                if (_tables[i]) { return Iterator(_tables[i], this); } 
            } 
            return End(); 
        } 
        Iterator End() { return Iterator(nullptr, this); } 
        ConstIterator Begin() const { 
            if (_n == 0) { return End(); } 
            for (size_t i = 0; i < _tables.size(); i++) { 
                if (_tables[i]) { return ConstIterator(_tables[i], this); } 
            } 
            return End(); 
        } 
        ConstIterator End() const { return Iterator(nullptr, this); } 
        HashTable() :_tables(__stl_next_prime(1), nullptr) , _n(0) { } 
        ~HashTable() { 
            for (size_t i = 0; i < _tables.size(); i++) { 
                Node* cur = _tables[i]; 
                while (cur) { 
                    Node* next = cur->_next; 
                    delete cur; 
                    cur = next; 
                } 
                _tables[i] = nullptr; 
            } 
            _n = 0; 
        } 
        pair<Iterator, bool> Insert(const T& data) { 
            KeyOfT kot; 
            if (auto it = Find(kot(data)); it != End()) return { it, false }; 
            Hash hs; 
            // 负载因子 == 1 就开始扩容 
            if (_n == _tables.size()) { 
                std::vector<Node*> newtables(__stl_next_prime(_tables.size() + 1), nullptr); 
                for (size_t i = 0; i < _tables.size(); i++) { 
                    // 遍历旧表，旧表节点重新映射，挪动到新表 
                    Node* cur = _tables[i]; 
                    while (cur) { 
                        Node* next = cur->_next; 
                        // 头插 
                        size_t hashi = hs(kot(cur->_data)) % newtables.size(); 
                        cur->_next = newtables[hashi]; 
                        newtables[hashi] = cur; 
                        cur = next; 
                    } 
                    _tables[i] = nullptr; 
                } 
                _tables.swap(newtables); 
            } 
            size_t hashi = hs(kot(data)) % _tables.size(); 
            // 头插 
            Node* newnode = new Node(data); 
            newnode->_next = _tables[hashi]; 
            _tables[hashi] = newnode; 
            ++_n; 
            return { Iterator(newnode, this), true }; 
        } 
        Iterator Find(const K& key) { 
            KeyOfT kot; 
            Hash hs; 
            size_t hashi = hs(key) % _tables.size(); 
            Node* cur = _tables[hashi]; 
            while (cur) { 
                if (kot(cur->_data) == key) { 
                    return { cur, this }; 
                } 
                cur = cur->_next; 
            } 
            return { nullptr, this }; 
        } 
        bool Erase(const K& key) { 
            KeyOfT kot; 
            Hash hs; 
            size_t hashi = hs(key) % _tables.size(); 
            Node* prev = nullptr; 
            Node* cur = _tables[hashi]; 
            while (cur) { 
                if (kot(cur->_data) == key) { 
                    // 删除 
                    if (prev == nullptr) { 
                        // 桶中第一个节点 
                        _tables[hashi] = cur->_next; 
                    } else { 
                        prev->_next = cur->_next; 
                    } 
                    --_n; 
                    delete cur; 
                    return true; 
                } 
                prev = cur; 
                cur = cur->_next; 
            } 
            return false; 
        } 
    private: 
        std::vector<Node*> _tables; // 指针数组 
        size_t _n; 
        // std::vector<std::list<K, V>> _tables; 
    }; 
}

#include<iostream>
#include<unordered_map>
using namespace std;
#include"HashTable.h"
void Print(const hash_bucket::HashTable<int, int>& s) {
    hash_bucket::HashTable<int, int>::Iterator it = s.Begin();
    while (it != s.End()) {
        cout << *it << " ";
        ++it;
    }
    cout << endl;
}
int main() {
    hash_bucket::HashTable<int, int> us;
    us.Insert(3);
    us.Insert(1000);
    us.Insert(2);
    us.Insert(102);
    us.Insert(2111);
    us.Insert(22);
    hash_bucket::HashTable<int, int>::Iterator it = us.Begin();
    while (it != us.End()) {
        cout << *it << " ";
        ++it;
    }
    cout << endl;
    Print(us);
    hash_bucket::HashTable<string, string> dict;
    dict.Insert({ "string", "字符串" });
    dict.Insert({ "left", "左边" });
    dict.Insert({ "right", "右边" });
    dict.Insert({ "map", "图" });
    for (auto& [k, v] : dict) {
        cout << k << ":" << v << endl;
    }
    return 0;
}

C++ 哈希表原理与实战：从概念到实现

前言：

一、哈希的概念

1.1 直接定址法

1.2 哈希冲突

1.3 负载因子

更多推荐文章

相关免费在线工具

1.4 将关键字转为整数

二、哈希函数

2.1 除法散列法 / 除留余数法

2.2 乘法散列法（了解）

2.3 全域散列法（了解）

2.4 其他方法（了解）

三、处理哈希冲突（重点！）

3.1 开放定址法

3.1.1 线性探测（含群集/堆积问题）

3.1.2 二次探测

3.1.3 双重散列

四、全面解析开放定址法（结构到细节的实战）

4.1 哈希表结构的实现

4.2 扩容问题

4.3 key 不能取模的问题

五、链地址法

5.1 为什么用链地址法

5.2 扩容

5.3 极端场景

5.4 哈希桶的实现

5.4.1 哈希桶的结构

5.4.2 插入

5.4.3 查找

5.4.4 删除

六、完整代码：

HashTable.h：

Test.cpp：

更多推荐文章

相关免费在线工具

C++ 哈希表原理与实战：从概念到实现

前言：

一、哈希的概念

1.1 直接定址法

1.2 哈希冲突

1.3 负载因子

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

1.4 将关键字转为整数

二、哈希函数

2.1 除法散列法 / 除留余数法

2.2 乘法散列法（了解）

2.3 全域散列法（了解）

2.4 其他方法（了解）

三、处理哈希冲突（重点！）

3.1 开放定址法

3.1.1 线性探测（含群集/堆积问题）

3.1.2 二次探测

3.1.3 双重散列

四、全面解析开放定址法（结构到细节的实战）

4.1 哈希表结构的实现

4.2 扩容问题

4.3 key 不能取模的问题

五、链地址法

5.1 为什么用链地址法

5.2 扩容

5.3 极端场景

5.4 哈希桶的实现

5.4.1 哈希桶的结构

5.4.2 插入

5.4.3 查找

5.4.4 删除

六、完整代码：

HashTable.h：

Test.cpp：

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具