C++ unordered 系列容器使用与底层模拟实现 | 极客日志

C++算法

C++ unordered 系列容器使用与底层模拟实现

C++ unordered 系列容器基于哈希表实现，提供 O(1) 平均时间复杂度的增删查操作。本文对比了其与红黑树实现（set/map）在遍历顺序、Key 约束及性能上的差异，并深入剖析了底层哈希桶结构。通过模拟实现 myunordered_map 和 myunordered_set，展示了如何复用通用哈希表框架，处理 Key 提取仿函数、迭代器遍历逻辑以及扩容机制，帮助理解 STL 容器的核心设计思想。

橘子海发布于 2026/3/28更新于 2026/7/1932 浏览

C++ unordered 系列容器使用与底层模拟实现

一、unordered_map 和 unordered_set 的使用

1. unordered_set 系列的使用

1.1 unordered_set 类的介绍

unordered_set 的声明如下，Key 就是底层关键字的类型。默认要求 Key 支持转换为整形，如果不支持或者想按自己的需求走可以自行实现支持将 Key 转成整形的仿函数传给第二个模板参数。同时默认要求 Key 支持比较相等，如果不支持或者想按自己的需求走可以自行实现支持将 Key 比较相等的仿函数传给第三个模板参数。

unordered_set 底层是用哈希桶实现，增删查平均效率是 O(1)，迭代器遍历无序。因为 unordered 系列容器底层是通过哈希桶实现，要支持哈希函数以及处理哈希冲突的方式，所以 Key 要支持转换成整形并且要求支持比较相等。针对不能转换成整形的类型，支持通过传入仿函数进行转换。

unordered_set 底层存储数据的内存是从空间配置器申请的，如果需要可以自己实现内存池，传给第四个参数，一般情况下我们都不需要传后三个模板参数。

其实一开始 STL 库先提供了 map/set 容器（红黑树封装实现），不过后来鉴于哈希桶实现的 map 和 set 确实有自身优势，STL 就又提供相关哈希桶实现容器。不过因为 map、set 的命名已经确定了，如果以 hashmap/hashset 新容器命名，无法很好凸显不同容器的特点。因为红黑树实现的 map、set 迭代器遍历有序，哈希表实现的遍历不再有序，所以 STL 中取名为 unordered_set/unordered_map。

前面部分我们已经学习了 set 容器的使用，set 和 unordered_set 的功能高度相似，只是底层结构不同，有一些性能和使用的差异，这里我们只讲他们的差异部分。

template < class Key,
             class Hash = hash<Key>,
             class Pred = equal_to<Key>,
             class Alloc = allocator<Key> >
class unordered_set;

1.2 unordered_set 和 set 的使用差异

查看文档我们会发现 unordered_set 的支持增删查且跟 set 的使用一模一样，关于使用我们这里就不再赘述和演示了。

unordered_set 和 set 的第一个差异是对 key 的要求不同，set 要求 Key 支持小于比较，而 unordered_set 要求 Key 支持转成整形且支持等于比较。要理解 unordered_set 的这个两点要求要结合哈希表底层实现才能真正理解，也就是说这本质是哈希表的要求。

unordered_set 和 set 的第二个差异是迭代器的差异，set 的 iterator 是双向迭代器，unordered_set 是单向迭代器。其次 set 底层是红黑树，红黑树是二叉搜索树，走中序遍历是有序的，所以 set 迭代器遍历是有序 + 去重。而 unordered_set 底层是哈希表，迭代器遍历是无序 + 去重。

unordered_set 和 set 的第三个差异是性能的差异，整体而言大多数场景下，unordered_set 的增删查改更快一些，因为红黑树增删查改效率是 O(logN)，而哈希表增删查平均效率是 O(1)。具体可以参看下面代码的演示的对比差异。

#include<unordered_set>
#include<set>
#include<iostream>
using  std;

{
      N = ;
    unordered_set<> us;
    set<> s;
    vector<> v;
    v.(N);
    (());
     ( i = ; i < N; ++i) {
        v.(() + i); 
    }

     begin1 = ();
     ( e : v) { s.(e); }
     end1 = ();
    cout <<  << end1 - begin1 << endl;

     begin2 = ();
    us.(N);
     ( e : v) { us.(e); }
     end2 = ();
    cout <<  << end2 - begin2 << endl;

     m1 = ;
     begin3 = ();
     ( e : v) {
         ret = s.(e);
         (ret != s.()) { ++m1; }
    }
     end3 = ();
    cout <<  << end3 - begin3 <<  << m1 << endl;

     m2 = ;
     begin4 = ();
     ( e : v) {
         ret = us.(e);
         (ret != us.()) { ++m2; }
    }
     end4 = ();
    cout <<  << end4 - begin4 <<  << m2 << endl;

    cout <<  << s.() << endl;
    cout <<  << us.() << endl << endl;

     begin5 = ();
     ( e : v) { s.(e); }
     end5 = ();
    cout <<  << end5 - begin5 << endl;

     begin6 = ();
     ( e : v) { us.(e); }
     end6 = ();
    cout <<  << end6 - begin6 << endl << endl;

     ;
}

{
    ();
     ;
}

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
Base64 字符串编码/解码
将字符串编码和解码为其 Base64 格式表示形式即可。在线工具，Base64 字符串编码/解码在线工具，online
Base64 文件转换器
将字符串、文件或图像转换为其 Base64 表示形式。在线工具，Base64 文件转换器在线工具，online
Markdown转HTML
将 Markdown（GFM）转为 HTML 片段，浏览器内 marked 解析；与 HTML转Markdown 互为补充。在线工具，Markdown转HTML在线工具，online
HTML转Markdown
将 HTML 片段转为 GitHub Flavored Markdown，支持标题、列表、链接、代码块与表格等；浏览器内处理，可链接预填。在线工具，HTML转Markdown在线工具，online

Buckets	接口说明
bucket_count	返回容器中的桶数量
max_bucket_count	返回容器可以拥有的最大桶数
bucket_size	返回桶 n 中的元素数量
bucket	返回元素值 k 所在的桶号

Hash policy	接口说明
load_factor	返回容器中的当前负载因子
max_load_factor	获取或设置最大负载因子
rehash	将容器中的桶数量设置为 n 或更多，强制执行重新散列。当容器的负载因子即将超过其最大负载因子时，容器会自动执行 rehash。此函数需要桶的数量作为参数。
reserve	将容器中的桶数量（bucket_count）设置为最适合至少包含 n 个元素的数量。如果 n 大于当前 bucket_count 乘以 max_load_factor，则容器中的 bucket_count 会增加，并强制进行重新哈希。

// stl_hash_set template <class Value, class HashFcn = hash<Value>, class EqualKey = equal_to<Value>, class Alloc = alloc>
class hash_set {
private:
    typedef hashtable<Value, Value, HashFcn, identity<Value>, EqualKey, Alloc> ht;
    ht rep;
public:
    // ... type definitions ...
};

// stl_hash_map template <class Key, class T, class HashFcn = hash<Key>, class EqualKey = equal_to<Key>, class Alloc = alloc>
class hash_map {
private:
    typedef hashtable<pair<const Key, T>, Key, HashFcn, select1st<pair<const Key, T> >, EqualKey, Alloc> ht;
    ht rep;
public:
    // ... type definitions ...
};

// stl_hashtable.h template <class Value, class Key, class HashFcn, class ExtractKey, class EqualKey, class Alloc>
class hashtable {
public:
    // ... interface ...
private:
    hasher hash;
    key_equal equals;
    ExtractKey get_key;
    typedef __hashtable_node<Value> node;
    vector<node*, Alloc> buckets;
    size_type num_elements;
};

// MyUnorderedSet.h
namespace zlr {
template<class K, class Hash = HashFunc<K>>
class unordered_set {
    struct SetKeyOfT {
        const K& operator()(const K& key) { return key; }
    };
public:
    bool insert(const K& key) { return _ht.Insert(key); }
private:
    hash_bucket::HashTable<K, K, SetKeyOfT, Hash> _ht;
};
}

// MyUnorderedMap.h
namespace zlr {
template<class K, class V, class Hash = HashFunc<K>>
class unordered_map {
    struct MapKeyOfT {
        const K& operator()(const pair<K, V>& kv) { return kv.first; }
    };
public:
    bool insert(const pair<K, V>& kv) { return _ht.Insert(kv); }
private:
    hash_bucket::HashTable<K, pair<K, V>, MapKeyOfT, Hash> _ht;
};
}

// HashTable.h
template<class K> struct HashFunc {
    size_t operator()(const K& key) { return (size_t)key; }
};

namespace hash_bucket {
template<class T> struct HashNode {
    T _data;
    HashNode<T>* _next;
    HashNode(const T& data) :_data(data), _next(nullptr) {}
};

template<class K, class T, class KeyOfT, class Hash>
class HashTable {
    typedef HashNode<T> Node;
    inline unsigned long __stl_next_prime(unsigned long n) {
        static const int __stl_num_primes = 28;
        static const unsigned long __stl_prime_list[__stl_num_primes] = {
            53, 97, 193, 389, 769, 1543, 3079, 6151, 12289, 24593, 49157, 98317,
            196613, 393241, 786433, 1572869, 3145739, 6291469, 12582917, 25165843,
            50331653, 100663319, 201326611, 402653189, 805306457, 1610612741,
            3221225473, 4294967291
        };
        const unsigned long* first = __stl_prime_list;
        const unsigned long* last = __stl_prime_list + __stl_num_primes;
        const unsigned long* pos = lower_bound(first, last, n);
        return pos == last ? *(last - 1) : *pos;
    }
public:
    HashTable() { _tables.resize(__stl_next_prime(_tables.size()), nullptr); }
    ~HashTable() {
        for (size_t i = 0; i < _tables.size(); i++) {
            Node* cur = _tables[i];
            while (cur) { Node* next = cur->_next; delete cur; cur = next; }
            _tables[i] = nullptr;
        }
    }

    bool Insert(const T& data) {
        KeyOfT kot;
        if (Find(kot(data))) return false;
        Hash hs;
        size_t hashi = hs(kot(data)) % _tables.size();
        // 负载因子==1 扩容
        if (_n == _tables.size()) {
            vector<Node*> newtables(__stl_next_prime(_tables.size()), nullptr);
            for (size_t i = 0; i < _tables.size(); i++) {
                Node* cur = _tables[i];
                while (cur) {
                    Node* next = cur->_next;
                    size_t hashi = hs(kot(cur->_data)) % newtables.size();
                    cur->_next = newtables[hashi];
                    newtables[hashi] = cur;
                    cur = next;
                }
                _tables[i] = nullptr;
            }
            _tables.swap(newtables);
        }
        Node* newnode = new Node(data);
        newnode->_next = _tables[hashi];
        _tables[hashi] = newnode;
        ++_n;
        return true;
    }
private:
    vector<Node*> _tables;
    size_t _n = 0;
};
}

template <class Value, class Key, class HashFcn, class ExtractKey, class EqualKey, class Alloc>
struct __hashtable_iterator {
    typedef hashtable<Value, Key, HashFcn, ExtractKey, EqualKey, Alloc> hashtable;
    typedef __hashtable_iterator<Value, Key, HashFcn, ExtractKey, EqualKey, Alloc> iterator;
    typedef __hashtable_const_iterator<Value, Key, HashFcn, ExtractKey, EqualKey, Alloc> const_iterator;
    typedef __hashtable_node<Value> node;
    typedef forward_iterator_tag iterator_category;
    typedef Value value_type;
    node* cur;
    hashtable* ht;
    __hashtable_iterator(node* n, hashtable* tab) : cur(n), ht(tab) {}
    __hashtable_iterator() {}
    reference operator*() const { return cur->val; }
    pointer operator->() const { return &(operator*()); }
    iterator& operator++();
    iterator operator++(int);
    bool operator==(const iterator& it) const { return cur == it.cur; }
    bool operator!=(const iterator& it) const { return cur != it.cur; }
};

template <class V, class K, class HF, class ExK, class EqK, class A>
__hashtable_iterator<V, K, HF, ExK, EqK, A>& __hashtable_iterator<V, K, HF, ExK, EqK, A>::operator++() {
    const node* old = cur;
    cur = cur->next;
    if (!cur) {
        size_type bucket = ht->bkt_num(old->val);
        while (!cur && ++bucket < ht->buckets.size()) cur = ht->buckets[bucket];
    }
    return *this;
}

// HashTable.h
template<class K> struct HashFunc {
    size_t operator()(const K& key) { return (size_t)key; }
};
// 特化
template<> struct HashFunc<string> {
    size_t operator()(const string& key) {
        size_t hash = 0;
        for (auto e : key) { hash *= 131; hash += e; }
        return hash;
    }
};

namespace hash_bucket {
template<class T> struct HashNode {
    T _data;
    HashNode<T>* _next;
    HashNode(const T& data) :_data(data), _next(nullptr) {}
};

// 前置声明
template<class K, class T, class KeyOfT, class Hash> class HashTable;

template<class K, class T, class Ptr, class Ref, class KeyOfT, class Hash>
struct HTIterator {
    typedef HashNode<T> Node;
    typedef HTIterator<K, T, Ptr, Ref, KeyOfT, Hash> Self;
    Node* _node;
    const HashTable<K, T, KeyOfT, Hash>* _pht;
    HTIterator(Node* node, const HashTable<K, T, KeyOfT, Hash>* pht) :_node(node), _pht(pht) {}
    Ref operator*() { return _node->_data; }
    Ptr operator->() { return &_node->_data; }
    bool operator!=(const Self& s) { return _node != s._node; }
    Self& operator++() {
        if (_node->_next) {
            _node = _node->_next;
        } else {
            KeyOfT kot;
            Hash hs;
            size_t hashi = hs(kot(_node->_data)) % _pht->_tables.size();
            ++hashi;
            while (hashi < _pht->_tables.size()) {
                if (_pht->_tables[hashi]) { break; }
                ++hashi;
            }
            if (hashi == _pht->_tables.size()) {
                _node = nullptr;
            } else {
                _node = _pht->_tables[hashi];
            }
        }
        return *this;
    }
};

template<class K, class T, class KeyOfT, class Hash>
class HashTable {
    template<class K, class T, class Ptr, class Ref, class KeyOfT, class Hash>
    friend struct HTIterator;
    typedef HashNode<T> Node;
public:
    typedef HTIterator<K, T, T*, T&, KeyOfT, Hash> Iterator;
    typedef HTIterator<K, T, const T*, const T&, KeyOfT, Hash> ConstIterator;
    Iterator Begin() {
        if (_n == 0) return End();
        for (size_t i = 0; i < _tables.size(); i++) {
            Node* cur = _tables[i];
            if (cur) { return Iterator(cur, this); }
        }
        return End();
    }
    Iterator End() { return Iterator(nullptr, this); }
    ConstIterator Begin() const {
        if (_n == 0) return End();
        for (size_t i = 0; i < _tables.size(); i++) {
            Node* cur = _tables[i];
            if (cur) { return ConstIterator(cur, this); }
        }
        return End();
    }
    ConstIterator End() const { return ConstIterator(nullptr, this); }
    // ... other methods like Insert, Find, Erase ...
    pair<Iterator, bool> Insert(const T& data) {
        KeyOfT kot;
        Iterator it = Find(kot(data));
        if (it != End()) return make_pair(it, false);
        Hash hs;
        size_t hashi = hs(kot(data)) % _tables.size();
        if (_n == _tables.size()) {
            vector<Node*> newtables(__stl_next_prime(_tables.size() + 1), nullptr);
            for (size_t i = 0; i < _tables.size(); i++) {
                Node* cur = _tables[i];
                while (cur) {
                    Node* next = cur->_next;
                    size_t hashi = hs(kot(cur->_data)) % newtables.size();
                    cur->_next = newtables[hashi];
                    newtables[hashi] = cur;
                    cur = next;
                }
                _tables[i] = nullptr;
            }
            _tables.swap(newtables);
        }
        Node* newnode = new Node(data);
        newnode->_next = _tables[hashi];
        _tables[hashi] = newnode;
        ++_n;
        return make_pair(Iterator(newnode, this), true);
    }
    Iterator Find(const K& key) {
        KeyOfT kot;
        Hash hs;
        size_t hashi = hs(key) % _tables.size();
        Node* cur = _tables[hashi];
        while (cur) {
            if (kot(cur->_data) == key) { return Iterator(cur, this); }
            cur = cur->_next;
        }
        return End();
    }
    bool Erase(const K& key) {
        KeyOfT kot;
        Hash hs;
        size_t hashi = hs(key) % _tables.size();
        Node* prev = nullptr;
        Node* cur = _tables[hashi];
        while (cur) {
            if (kot(cur->_data) == key) {
                if (prev == nullptr) { _tables[hashi] = cur->_next; }
                else { prev->_next = cur->_next; }
                delete cur; --_n; return true;
            }
            prev = cur; cur = cur->_next;
        }
        return false;
    }
private:
    vector<Node*> _tables;
    size_t _n = 0;
};
}

// MyUnorderedSet.h
namespace zlr {
template<class K, class Hash = HashFunc<K>>
class unordered_set {
    struct SetKeyOfT {
        const K& operator()(const K& key) { return key; }
    };
public:
    typedef typename hash_bucket::HashTable<K, const K, SetKeyOfT, Hash>::Iterator iterator;
    typedef typename hash_bucket::HashTable<K, const K, SetKeyOfT, Hash>::ConstIterator const_iterator;
    iterator begin() { return _ht.Begin(); }
    iterator end() { return _ht.End(); }
    const_iterator begin() const { return _ht.Begin(); }
    const_iterator end() const { return _ht.End(); }
    pair<iterator, bool> insert(const K& key) { return _ht.Insert(key); }
    iterator Find(const K& key) { return _ht.Find(key); }
    bool Erase(const K& key) { return _ht.Erase(key); }
private:
    hash_bucket::HashTable<K, const K, SetKeyOfT, Hash> _ht;
};
void test_set() {
    unordered_set<int> s;
    int a[] = { 4, 2, 6, 1, 3, 5, 15, 7, 16, 14, 3,3,15 };
    for (auto e : a) { s.insert(e); }
    for (auto e : s) { cout << e << " "; }
    cout << endl;
    unordered_set<int>::iterator it = s.begin();
    while (it != s.end()) {
        cout << *it << " "; ++it;
    }
    cout << endl;
}
}

// MyUnorderedMap.h
namespace zlr {
template<class K, class V, class Hash = HashFunc<K>>
class unordered_map {
    struct MapKeyOfT {
        const K& operator()(const pair<K, V>& kv) { return kv.first; }
    };
public:
    typedef typename hash_bucket::HashTable<K, pair<const K, V>, MapKeyOfT, Hash>::Iterator iterator;
    typedef typename hash_bucket::HashTable<K, pair<const K, V>, MapKeyOfT, Hash>::ConstIterator const_iterator;
    iterator begin() { return _ht.Begin(); }
    iterator end() { return _ht.End(); }
    const_iterator begin() const { return _ht.Begin(); }
    const_iterator end() const { return _ht.End(); }
    pair<iterator, bool> insert(const pair<K, V>& kv) { return _ht.Insert(kv); }
    V& operator[](const K& key) {
        pair<iterator, bool> ret = _ht.Insert(make_pair(key, V()));
        return ret.first->second;
    }
    iterator Find(const K& key) { return _ht.Find(key); }
    bool Erase(const K& key) { return _ht.Erase(key); }
private:
    hash_bucket::HashTable<K, pair<const K, V>, MapKeyOfT, Hash> _ht;
};
void test_map() {
    unordered_map<string, string> dict;
    dict.insert({ "sort", "排序" });
    dict.insert({ "left", "左边" });
    dict.insert({ "right", "右边" });
    dict["left"] = "左边，剩余";
    dict["insert"] = "插入";
    dict["string"];
    unordered_map<string, string>::iterator it = dict.begin();
    while (it != dict.end()) {
        it->second += 'x';
        cout << it->first << ":" << it->second << endl;
        ++it;
    }
    cout << endl;
}
}

C++ unordered 系列容器使用与底层模拟实现

C++ unordered 系列容器使用与底层模拟实现

一、unordered_map 和 unordered_set 的使用

1. unordered_set 系列的使用

1.1 unordered_set 类的介绍

1.2 unordered_set 和 set 的使用差异

更多推荐文章

相关免费在线工具

1.3 unordered_map 和 map 的使用差异

1.4 unordered_multimap/unordered_multiset

1.5 unordered_xxx 的哈希相关接口

二、用哈希表封装 myunordered_map 和 myunordered_set

1. 源码及框架分析

2. 模拟实现 unordered_map 和 unordered_set

2.1 实现出复用哈希表的框架，并支持 insert

2.2 支持 iterator 的实现

iterator 核心源代码

iterator 实现思路分析

2.3 map 支持 []

2.4 bit::unordered_map 和 bit::unordered_set 代码实现

更多推荐文章

相关免费在线工具

C++ unordered 系列容器使用与底层模拟实现

C++ unordered 系列容器使用与底层模拟实现

一、unordered_map 和 unordered_set 的使用

1. unordered_set 系列的使用

1.1 unordered_set 类的介绍

1.2 unordered_set 和 set 的使用差异

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

1.3 unordered_map 和 map 的使用差异

1.4 unordered_multimap/unordered_multiset

1.5 unordered_xxx 的哈希相关接口

二、用哈希表封装 myunordered_map 和 myunordered_set

1. 源码及框架分析

2. 模拟实现 unordered_map 和 unordered_set

2.1 实现出复用哈希表的框架，并支持 insert

2.2 支持 iterator 的实现

iterator 核心源代码

iterator 实现思路分析

2.3 map 支持 []

2.4 bit::unordered_map 和 bit::unordered_set 代码实现

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具