C++ unordered 系列容器认识与模拟实现 | 极客日志

C++算法

C++ unordered 系列容器认识与模拟实现

介绍 C++ STL 中基于哈希表实现的 unordered_map 和 unordered_set 容器。阐述了其无序存储、O(1) 平均时间复杂度的特性，对比了与普通 map/set 的差异。重点讲解了底层哈希桶结构、冲突解决及自定义哈希函数方法。最后通过代码模拟实现了 unordered 系列的核心逻辑，包括迭代器单向遍历、扩容机制及插入删除操作。

蜜桃汽水发布于 2026/3/30更新于 2026/7/2045 浏览

1. 了解 unordered 系列

在 C++ STL（标准模板库）中，unordered_map 和 unordered_set 是两种基于哈希表（Hash Table）实现的关联式容器，核心特点是无序存储、平均时间复杂度 O(1) 的增删查操作，但需注意哈希冲突对性能的影响。二者设计目标不同（键值对存储 vs 单一值存储），但底层原理和核心特性高度相似。

1.1 初步认识

unordered_set：

unordered_set 的声明如下，Key 就是 unordered_set 底层关键字的类型：

template<class Key, // unordered_set::key_type/value_type
class Hash = hash<Key>, // unordered_set::hasher
class Pred = equal_to<Key>, // unordered_set::key_equal
class Alloc = allocator<Key> // unordered_set::allocator_type>
class unordered_set;

unordered_set 默认要求 Key 支持转换为整形，如果不支持或者想按自己的需求走可以自行实现支持将 Key 转成整形的仿函数传给第二个模板参数；
unordered_set 默认要求 Key 支持比较相等，如果不支持或者想按自己的需求走可以自行实现支持将 Key 比较相等的仿函数传给第三个模板参数；
unordered_set 底层存储数据的内存是从空间配置器申请的，如果需要可以自己实现内存池，传给第四个参数。

一般情况下，我们都不需要传后三个模板参数。unordered_set 迭代器遍历不再有序，为了跟 set 区分，所以取名 unordered_set。前面部分我们已经学习了 set 容器的使用，set 和 unordered_set 的功能高度相似，只是底层结构不同，有一些性能和使用上的差异，下面会讲它们的差异部分。

unordered_map：

unordered_map 的声明如下：

template<class Key, // unordered_map::key_type
class T, // unordered_map::mapped_type
class Hash = hash<Key>, // unordered_map::hasher
class Pred = equal_to<Key>, // unordered_map::key_equal
class Alloc = allocator<pair< K, V>>> 
 unordered_map;

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
Base64 字符串编码/解码
将字符串编码和解码为其 Base64 格式表示形式即可。在线工具，Base64 字符串编码/解码在线工具，online
Base64 文件转换器
将字符串、文件或图像转换为其 Base64 表示形式。在线工具，Base64 文件转换器在线工具，online
Markdown转HTML
将 Markdown（GFM）转为 HTML 片段，浏览器内 marked 解析；与 HTML转Markdown 互为补充。在线工具，Markdown转HTML在线工具，online
HTML转Markdown
将 HTML 片段转为 GitHub Flavored Markdown，支持标题、列表、链接、代码块与表格等；浏览器内处理，可链接预填。在线工具，HTML转Markdown在线工具，online

特性	unordered_map	unordered_set
存储元素	键值对（`std::pair<const K, V>`）	单一值（`const K`，值即键）
核心功能	通过键快速查找对应的值（键值映射）	快速判断一个值是否存在（去重、判重）
关键接口差异	支持 `operator[]`（通过键访问 / 修改值）	无 `operator[]`（仅需判断值是否存在）
迭代器解引用结果	指向键值对（`pair<const K, V>`），可通过 `->first` 访问键、`->second` 访问值	指向单一值（`const K`），解引用直接得到键
典型使用场景	字典（如单词 - 翻译映射）、缓存（如 ID - 用户信息映射）	去重（如统计数组中不重复元素）、快速判重（如判断用户是否在黑名单）

#include <iostream>
#include <unordered_map>
#include <string>
using namespace std;

int main()
{
    // 1. 定义 unordered_map（键：string，值：int）
    unordered_map<string, int> score_map;
    
    // 2. 插入键值对（三种方式）
    score_map.insert({"Alice", 95}); // 列表初始化
    score_map.insert(pair<string, int>("Bob", 88)); // pair 构造
    score_map["Charlie"] = 92; // operator[] 插入（不存在则新建，存在则修改值）
    
    // 3. 查找键（find 返回迭代器，未找到则返回 end()）
    auto it = score_map.find("Bob");
    if (it != score_map.end())
    {
        cout << "Bob's score: " << it->second << endl; // 访问值：it->second
    }
    
    // 4. 修改值（通过 operator[] 或迭代器）
    score_map["Bob"] = 90; // operator[] 直接修改
    it->second = 90; // 迭代器修改（需确保迭代器有效）
    
    // 5. 遍历（无序）
    for (const auto& kv : score_map)
    {
        cout << kv.first << ": " << kv.second << endl; // kv.first 是键，kv.second 是值
    }
    
    // 6. 删除键
    score_map.erase("Charlie");
    return 0;
}

#include <iostream>
#include <unordered_set>
#include <vector>
using namespace std;

int main()
{
    // 1. 定义 unordered_set（存储 int 类型，值即键）
    unordered_set<int> num_set;
    
    // 2. 插入值（去重特性：重复值插入失败）
    vector<int> nums = {3, 1, 4, 1, 5, 9, 2, 6, 5};
    for (int num : nums)
    {
        auto ret = num_set.insert(num); // 返回 pair<iterator, bool>
        if (!ret.second)
        {
            cout << "值 " << num << " 已存在，插入失败" << endl;
        }
    }
    
    // 3. 查找值（判断是否存在）
    if (num_set.count(5)) // count 返回 0（不存在）或 1（存在）
    {
        cout << "值 5 存在于集合中" << endl;
    }
    
    // 4. 遍历（无序，去重后结果）
    for (int num : num_set)
    {
        cout << num << " "; // 直接访问值（即键）
    }
    cout << endl;
    
    // 5. 删除值
    num_set.erase(9);
    return 0;
}

容器系列	底层数据结构	核心设计目标
unordered 系列	链式哈希表（哈希桶）	追求「快速增删查」，以空间换时间
普通有序系列	红黑树（自平衡二叉搜索树）	追求「有序存储」，兼顾效率与排序

需求优先级	推荐容器	原因
追求增删查的效率（O(1)）	unordered_map/unordered_set	哈希表平均复杂度更低，适合数据量大、对效率敏感的场景
需要有序存储（如按键排序遍历）	map/set	红黑树天然有序，支持 `lower_bound`/`upper_bound` 等范围查询接口
存储自定义类型且不愿写哈希函数	map/set	仅需重载 `<` 运算符（红黑树排序用），无需自定义哈希函数
内存占用敏感	map/set	哈希表需预留桶数组空间（通常有负载因子，如 0.7），内存利用率略低

template<class K, class T, class Ref, class Ptr, class KeyOfT, class Hash>
struct HTIterator
{
    typedef HashNode<T> Node;
    typedef HTIterator<K, T, Ref, Ptr, KeyOfT, Hash> Self;
    Node* _node;
    const HashTable<K, T, KeyOfT, Hash>* _pht;
    
    // ... 构造函数省略
    
    Self& operator++()
    {
        if (_node->_next)
        {
            _node = _node->_next;
        }
        else
        {
            KeyOfT kot;
            Hash hs;
            size_t hashi = hs(kot(_node->_data)) % _pht->_tables.size() + 1;
            while (hashi < _pht->_tables.size())
            {
                if (_pht->_tables[hashi])
                {
                    _node = _pht->_tables[hashi];
                    break;
                }
                hashi++;
            }
            if (hashi == _pht->_tables.size())
            {
                _node = nullptr;
            }
        }
        return *this;
    }
};

#pragma once
#include "hash_bucket.h"
namespace hcy {
template<class K, class Hash = HashFunc<K>>
class unordered_set
{
    struct SetKeyOfT
    {
        const K& operator()(const K& key) { return key; }
    };
public:
    typedef typename hash_bucket::HashTable<K, const K, SetKeyOfT, Hash>::Iterator iterator;
    typedef typename hash_bucket::HashTable<K, const K, SetKeyOfT, Hash>::ConstIterator const_iterator;
    
    iterator begin() { return _ht.Begin(); }
    iterator end() { return _ht.End(); }
    const_iterator begin() const { return _ht.Begin(); }
    const_iterator end() const { return _ht.End(); }
    
    pair<iterator, bool> insert(const K& key) { return _ht.Insert(key); }
    iterator find(const K& key) { return _ht.Find(); }
    bool erase(const K& key) { return _ht.Erase(); }
private:
    hash_bucket::HashTable<K, const K, SetKeyOfT, Hash> _ht;
};
}

#pragma once
#include "hash_bucket.h"
namespace hcy {
template<class K, class V, class Hash = HashFunc<K>>
class unordered_map
{
    struct MapKeyOfT
    {
        const K& operator()(const pair<K, V>& kv) { return kv.first; }
    };
public:
    typedef typename hash_bucket::HashTable<K, pair<const K, V>, MapKeyOfT, Hash>::Iterator iterator;
    typedef typename hash_bucket::HashTable<K, pair<const K, V>, MapKeyOfT, Hash>::ConstIterator const_iterator;
    
    iterator begin() { return _ht.Begin(); }
    iterator end() { return _ht.End(); }
    const_iterator begin() const { return _ht.Begin(); }
    const_iterator end() const { return _ht.End(); }
    
    pair<iterator, bool> insert(const pair<K, V>& kv) { return _ht.Insert(kv); }
    iterator find(const K& key) { return _ht.Find(); }
    bool erase(const K& key) { return _ht.Erase(); }
    V& operator[](const K& key)
    {
        pair<iterator, bool> ret = _ht.Insert({ key, V() });
        return ret.first->second;
    }
private:
    hash_bucket::HashTable<K, pair<const K, V>, MapKeyOfT, Hash> _ht;
};
}

#pragma once
#include <iostream>
#include <string>
#include <vector>
using namespace std;

template<class K>
struct HashFunc
{
    size_t operator()(const K& key) { return (size_t)key; }
};

// 特化
template<>
struct HashFunc<string>
{
    size_t operator()(const string& s)
    {
        size_t ret = 0;
        // 这里我们使用 BKDR 哈希的思路，用上次的计算结果去乘以一个质数，这个质数一般取 31, 131 等效果会比较好
        for (auto e : s)
        {
            ret *= 31;
            ret += e;
        }
        return ret;
    }
};

namespace hash_bucket {
template<class T>
struct HashNode
{
    HashNode<T>* _next;
    T _data;
    HashNode(const T& data) :_data(data), _next(nullptr) {}
};

template<class K, class T, class KeyOfT, class Hash>
class HashTable;

template<class K, class T, class Ref, class Ptr, class KeyOfT, class Hash>
struct HTIterator
{
    typedef HashNode<T> Node;
    typedef HTIterator<K, T, Ref, Ptr, KeyOfT, Hash> Self;
    Node* _node;
    const HashTable<K, T, KeyOfT, Hash>* _pht;
    
    HTIterator(Node* node, const HashTable<K, T, KeyOfT, Hash>* pht) :_node(node), _pht(pht) {}
    
    Ref operator*() { return _node->_data; }
    Ptr operator->() { return &_node->_data; }
    
    // 单向迭代器
    Self& operator++()
    {
        if (_node->_next)
        {
            _node = _node->_next;
        }
        else
        {
            KeyOfT kot;
            Hash hs;
            size_t hashi = hs(kot(_node->_data)) % _pht->_tables.size() + 1;
            while (hashi < _pht->_tables.size())
            {
                if (_pht->_tables[hashi])
                {
                    _node = _pht->_tables[hashi];
                    break;
                }
                hashi++;
            }
            if (hashi == _pht->_tables.size())
            {
                _node = nullptr;
            }
        }
        return *this;
    }
    
    bool operator!=(const Self& self) { return self._node != _node; }
    bool operator==(const Self& self) { return self._node == _node; }
};

template<class K, class T, class KeyOfT, class Hash>
class HashTable
{
    // 友元声明
    template<class K, class T, class Ref, class Ptr, class KeyOfT, class Hash>
    friend struct HTIterator;
    
    typedef HashNode<T> Node;
    
    inline unsigned long __stl_next_prime(unsigned long n)
    {
        static const int __stl_num_primes = 28;
        static const unsigned long __stl_prime_list[__stl_num_primes] =
        {
            53, 97, 193, 389, 769, 1543, 3079, 6151, 12289, 24593, 49157, 98317,
            196613, 393241, 786433, 1572869, 3145739, 6291469, 12582917, 25165843,
            50331653, 100663319, 201326611, 402653189, 805306457, 1610612741,
            3221225473, 4294967291
        };
        const unsigned long* first = __stl_prime_list;
        const unsigned long* last = __stl_prime_list + __stl_num_primes;
        const unsigned long* pos = lower_bound(first, last, n);
        return pos == last ? *(last - 1) : *pos;
    }
public:
    typedef HTIterator<K, T, T&, T*, KeyOfT, Hash> Iterator;
    typedef HTIterator<K, T, const T&, const T*, KeyOfT, Hash> ConstIterator;
    
    HashTable() { _tables.resize(__stl_next_prime(0), nullptr); }
    
    ~HashTable()
    {
        for (size_t i = 0; i < _tables.size(); i++)
        {
            Node* cur = _tables[i];
            while (cur)
            {
                Node* tmp = cur;
                cur = cur->_next;
                delete tmp;
            }
            _tables[i] = nullptr;
        }
    }
    
    Iterator End() { return Iterator(nullptr, this); }
    
    Iterator Begin()
    {
        if (_n == 0) { return End(); }
        for (int i = 0; i < _tables.size(); i++)
        {
            if (_tables[i]) { return Iterator(_tables[i], this); }
        }
        return End();
    }
    
    ConstIterator End() const { return Iterator(nullptr, this); }
    
    ConstIterator Begin() const
    {
        if (_n == 0) { return End(); }
        for (int i = 0; i < _tables.size(); i++)
        {
            if (_tables[i]) { return Iterator(_tables[i], this); }
        }
        return End();
    }
    
    pair<Iterator, bool> Insert(const T& data)
    {
        KeyOfT kot;
        Hash hs;
        Iterator it = Find(kot(data));
        if (it != End()) { return { it, false }; }
        
        // 扩容
        if ((double)_n / _tables.size() >= 0.75)
        {
            vector<Node*> newtables(__stl_next_prime(_tables.size() + 1), nullptr);
            for (int i = 0; i < _tables.size(); i++)
            {
                Node* cur = _tables[i];
                while (cur)
                {
                    Node* next = cur->_next;
                    size_t hashi = hs(kot(cur->_data)) % newtables.size();
                    cur->_next = newtables[hashi];
                    newtables[hashi] = cur;
                    cur = next;
                }
            }
            _tables.swap(newtables);
        }
        
        size_t hash_i = hs(kot(data)) % _tables.size();
        Node* newnode = new Node(data);
        newnode->_next = _tables[hash_i];
        _tables[hash_i] = newnode;
        ++_n;
        return { Iterator(newnode, this), true };
    }
    
    Iterator Find(const K& key)
    {
        Hash hs;
        KeyOfT kot;
        size_t hashi = hs(key) % _tables.size();
        Node* cur = _tables[hashi];
        while (cur)
        {
            if (kot(cur->_data) == key) { return Iterator(cur, this); }
            cur = cur->_next;
        }
        return End();
    }
    
    bool Erase(const K& key)
    {
        Hash hs;
        KeyOfT kot;
        size_t hashi = hs(key) % _tables.size();
        Node* cur = _tables[hashi];
        Node* prev = nullptr;
        while (cur)
        {
            if (kot(cur->_data) == key)
            {
                if (prev == nullptr) { _tables[hashi] = cur->_next; }
                else { prev->_next = cur->_next; }
                delete cur;
                --_n;
                return true;
            }
            prev = cur;
            cur = cur->_next;
        }
        return false;
    }
private:
    // 指针数组
    vector<Node*> _tables;
    // 表中存储数据个数
    size_t _n = 0;
};
}

#include "unordered_set.h"
#include "unordered_map.h"

void test_map()
{
    hcy::unordered_map<string, string> dict;
    dict.insert({"sort", "排序"});
    dict.insert({"left", "左边"});
    dict.insert({"right", "右边"});
    dict["left"] = "左边";
    dict["insert"] = "插入";
    dict["string"];
    
    hcy::unordered_map<string, string>::iterator it = dict.begin();
    while (it != dict.end())
    {
        // 不能修改 first，可以修改 second
        // it->first += 'x';
        it->second += 'x';
        cout << it->first << ":" << it->second << endl;
        ++it;
    }
    cout << endl;
}

void test_set()
{
    hcy::unordered_set<int> s;
    int a[] = {4, 2, 6, 1, 3, 5, 15, 7, 16, 14, 3, 3, 15};
    for (auto e : a)
    {
        s.insert(e);
    }
    for (auto e : s)
    {
        cout << e << " ";
    }
    cout << endl;
    
    hcy::unordered_set<int>::iterator it = s.begin();
    while (it != s.end())
    {
        // 不支持修改
        //*it += 1;
        cout << *it << " ";
        ++it;
    }
    cout << endl;
}

int main()
{
    test_set();
    cout << endl;
    test_map();
    return 0;
}

C++ unordered 系列容器认识与模拟实现

1. 了解 unordered 系列

1.1 初步认识

更多推荐文章

相关免费在线工具

1.2 核心差异

1.3 使用示例

1.4 与普通 map/set 的区别

2. 模拟实现 unordered 系列

2.1 迭代器

更多推荐文章

相关免费在线工具

C++ unordered 系列容器认识与模拟实现

1. 了解 unordered 系列

1.1 初步认识

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

1.2 核心差异

1.3 使用示例

1.4 与普通 map/set 的区别

2. 模拟实现 unordered 系列

2.1 迭代器

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具