【C++】map和set的介绍及使用

前言：

map和 set 是 C++ STL（标准模板库）中的两种非常重要的容器，它们基于一种叫做平衡二叉搜索树（通常是红黑树）的数据结构来实现。在 C++ 中，map 是一个键值对容器，set 只存储唯一的键，而这两个容器都通过二叉树的结构来保持数据的有序性和高效的查找、插入、删除操作。

1. 序列式容器和关联式容器

前⾯我们已经接触过STL中的部分容器如：string、vector、list、deque、array、forward_list等，这些容器统称为序列式容器，因为逻辑结构为线性序列的数据结构，两个位置存储的值之间⼀般没有紧密的关联关系，⽐如交换⼀下，他依旧是序列式容器。顺序容器中的元素是按他们在容器中的存储位置来顺序保存和访问的。

关联式容器也是⽤来存储数据的，与序列式容器不同的是，关联式容器逻辑结构通常是⾮线性结构，两个位置有紧密的关联关系，交换⼀下，他的存储结构就被破坏了。顺序容器中的元素是按关键字来保存和访问的。关联式容器有map/set系列和unordered_map/unordered_set系列。

本章节讲解的map和set底层是红⿊树，红⿊树是⼀颗平衡⼆叉搜索树。set是key搜索场景的结构， map是key/value搜索场景的结构。

2. set系列的使用

2.1 set和multiset参考⽂档

set - C++ Reference (cplusplus.com)

2.2 set类的介绍

set的声明如下，T就是set底层关键字的类型

set默认要求T⽀持⼩于⽐较，如果不⽀持或者想按⾃⼰的需求⾛可以⾃⾏实现仿函数传给第⼆个模版参数

set底层存储数据的内存是从空间配置器申请的，如果需要可以⾃⼰实现内存池，传给第三个参数。

⼀般情况下，我们都不需要传后两个模版参数。

set底层是⽤红⿊树实现，增删查效率是，迭代器遍历是⾛的搜索树的中序，所以是有序的。 O(logN)

前⾯部分我们已经学习了vector/list等容器的使⽤，STL容器接⼝设计，⾼度相似，所以这⾥我们就不再⼀个接⼝⼀个接⼝的介绍，⽽是直接带着⼤家看⽂档，挑⽐较重要的接⼝进⾏介绍。

2.3 set的构造和迭代器

set的构造我们关注以下⼏个接⼝即可。

set的⽀持正向和反向迭代遍历，遍历默认按升序顺序，因为底层是⼆叉搜索树，迭代器遍历⾛的中序；⽀持迭代器就意味着⽀持范围for，set的iterator和const_iterator都不⽀持迭代器修改数据，修改关键字数据，破坏了底层搜索树的结构。

// empty (1) ⽆参默认构造
explicit set(const key_compare& comp = key_compare(),
   const allocator_type& alloc = allocator_type());

// range (2) 迭代器区间构造
template <class InputIterator>
set(InputIterator first, InputIterator last,
   const key_compare& comp = key_compare(),
   const allocator_type & = allocator_type());

// copy (3) 拷⻉构造
set(const set& x);

// initializer list (5) initializer 列表构造
set(initializer_list<value_type> il,
   const key_compare& comp = key_compare(),
   const allocator_type& alloc = allocator_type());

// 迭代器是⼀个双向迭代器
iterator->a bidirectional iterator to const value_type

// 正向迭代器
iterator begin();
iterator end();

// 反向迭代器
reverse_iterator rbegin();
reverse_iterator rend();

2.4 set的增删查

set的增删查关注以下⼏个接⼝即可：

Member types
key_type->The first template parameter(T)
value_type->The first template parameter(T)

// 单个数据插⼊，如果已经存在则插⼊失败
pair<iterator, bool> insert(const value_type& val);

// 列表插⼊，已经在容器中存在的值不会插⼊
void insert(initializer_list<value_type> il);

// 迭代器区间插⼊，已经在容器中存在的值不会插⼊
template <class InputIterator>
void insert(InputIterator first, InputIterator last);

// 查找val，返回val所在的迭代器，没有找到返回end()
iterator find(const value_type& val);

// 查找val，返回Val的个数
size_type count(const value_type& val) const;

// 删除⼀个迭代器位置的值
iterator erase(const_iterator position);

// 删除val，val不存在返回0，存在返回1
size_type erase(const value_type& val);

// 删除⼀段迭代器区间的值
iterator erase(const_iterator first, const_iterator last);

// 返回⼤于等val位置的迭代器
iterator lower_bound(const value_type& val) const;

// 返回⼤于val位置的迭代器
iterator upper_bound(const value_type& val) const;

2.5 insert和迭代器遍历使用样例

int main()
{//去重+升序/*set<int> s;*/set<int, greater<int>> s;s.insert(5);s.insert(2);s.insert(7);s.insert(5);s.insert(7);s.insert(3);//set<int>::iterator it = s.begin();auto it = s.begin();while (it != s.end()){// error C3892: “it”: 不能给常量赋值cout << *it << " ";++it;}cout << endl;s.insert({ 2,8,3,9,2 });for (auto e : s){cout << e << " ";}cout << endl;// void insert (initializer_list<value_type> il);//set<string> strset = { "sort","insert","add" };set<string> strset({ "sort","insert","add" });// 遍历string比较ascll码大小顺序遍历的for (auto& e : strset){cout << e << " ";}cout << endl;return 0;
}

运行结果：

2.6 find和erase使用样例：

int main()
{set<int> s = { 4,2,7,2,8,5,9 };for (auto e : s){cout << e << " ";}cout << endl;//删除最小值s.erase(s.begin());for (auto e : s){cout << e << " ";}cout << endl;//直接删除xint x;/*cin >> x;int num = s.erase(x);if (num == 0){cout << x << "不存在! " << endl;}else{cout << x << "删除成功！" << endl;}*/cin >> x;auto pos = s.find(x);if (pos != s.end()){//pos失效s.erase(pos);//cout<<*pos<<endl;}else{cout << x << "不存在!" << endl;}for (auto e : s){cout << e << " ";}cout << endl;//算法库的查找O(N)auto pos1 = find(s.begin(), s.end(),x);// set自身实现的查找 O(logN)auto pos2 = s.find(x);// 利用count间接实现快速查找cin >> x;if (s.count(x)){cout << x << "在！" << endl;}else{cout << x << "不存在！" << endl;}return 0;
}

2.7 返回⼤于等val和等于val的使用样例

int main()
{std::set<int> myset;for (int i = 1; i < 10; i++)myset.insert(i * 10); // 10 20 30 40 50 60 70 80 90for (auto e : myset){cout << e << " ";}cout << endl;// [30, 50]值// [25, 55]值返回 >= 30//auto itlow = myset.lower_bound(30);返回 > 50//auto itup = myset.upper_bound(50);//返回 >= 25auto itlow = myset.lower_bound(25);//返回 > 55auto itup = myset.upper_bound(55);// 删除这段区间的值myset.erase(itlow, itup);for (auto e : myset){cout << e << " ";}cout << endl;return 0;
}

3.multiset和set的差异

multiset和set的使⽤基本完全类似，主要区别点在于multiset⽀持值冗余，那么 insert/find/count/erase都围绕着⽀持值冗余有所差异，具体参看下⾯的样例代码理解。

int main()
{// 相比set不同的是，multiset是排序，但是不去重multiset<int> s = { 4,2,7,2,4,8,4,5,4,9 };auto it = s.begin();while (it != s.end()){cout << *it << " ";++it;}cout << endl;// 相比set不同的是，x可能会存在多个，find查找中序的第一个int x;cin >> x;auto pos = s.find(x);while (pos != s.end() && *pos == x){cout << *pos << " ";++pos;}cout << endl;// 相比set不同的是，count会返回x的实际个数cout << s.count(x) << endl;去重//pos = s.find(x);//while (pos != s.end() && *pos == x)//{//	pos = s.erase(pos);//}//cout << endl;// 相⽐set不同的是，erase给值时会删除所有的xs.erase(x);//it = s.begin();while (it != s.end()){cout << *it << " ";++it;}cout << endl;return 0;
}

运行结果：

3.1 两个数组的交集

我们来写个题试试

两个数组的交集

题目描述：

我们可以把数据放到set里面去因为set相同的值会插入失败，这样不就去重了，然后进行比较过程中让小的++

代码如下：

class Solution {
public:vector<int> intersection(vector<int>& nums1, vector<int>& nums2) {set<int> set1 (nums1.begin(),nums1.end());set<int> set2 (nums2.begin(),nums2.end());vector<int> ret;auto it1 = set1.begin();auto it2 = set2.begin();while(it1 != set1.end() && it2 != set2.end()){if(*it1 > *it2){++it2;}else if(*it2 > *it1){++it1;}else{ret.push_back(*it1);it1++;it2++;}}return ret;}
};

3.2环形列表 ||

环形列表 ||

数据结构初阶阶段，我们通过证明⼀个指针从头开始⾛⼀个指针从相遇点开始⾛，会在⼊⼝点相遇，理解证明都会很⿇烦。这⾥我们使⽤set查找记录解决⾮常简单⽅便，这⾥体现了set在解决⼀些问题时的价值，完全是降维打击。

思路：这里我们先遍历一遍，遍历过程中把数据插入到set的容器变量中然后用count来统计个数如果个数为0则就插入当第二次入环的时候数据已经插入进去了此时的节点就是入环节点直接返回

代码如下：

/*** Definition for singly-linked list.* struct ListNode {*     int val;*     ListNode *next;*     ListNode(int x) : val(x), next(NULL) {}* };*/
class Solution {
public:ListNode *detectCycle(ListNode *head) {set<ListNode*>s;ListNode*cur = head;while(cur){if(s.count(cur)){return cur;}else{s.insert(cur);}cur = cur->next;}return nullptr;}
};

4. map系列的使用

4.1map和multimap参考⽂档

map - C++ Reference

4.2 map类的介绍

map的声明如下，Key就是map底层关键字的类型，T是map底层value的类型，set默认要求Key⽀持⼩于⽐较，如果不⽀持或者需要的话可以⾃⾏实现仿函数传给第⼆个模版参数，map底层存储数据的内存是从空间配置器申请的。⼀般情况下，我们都不需要传后两个模版参数。map底层是⽤红⿊树实现，增删查改效率是 O(logN) ，迭代器遍历是⾛的中序，所以是按key有序顺序遍历的。

4.3pair类型介绍

typedef pair<const Key, T> value_type;
template <class T1, class T2>
struct pair
{typedef T1 first_type;typedef T2 second_type;T1 first;T2 second;pair() : first(T1()), second(T2()){}pair(const T1& a, const T2& b) : first(a), second(b){}template<class U, class V>pair(const pair<U, V>& pr) : first(pr.first), second(pr.second){}
};
template <class T1, class T2>
inline pair<T1, T2> make_pair(T1 x, T2 y)
{return (pair<T1, T2>(x, y));
}

4.4map的构造

map的构造我们关注以下⼏个接⼝即可。

map的⽀持正向和反向迭代遍历，遍历默认按key的升序顺序，因为底层是⼆叉搜索树，迭代器遍历⾛的中序；⽀持迭代器就意味着⽀持范围for，map⽀持修改value数据，不⽀持修改key数据，修改关键字数据，破坏了底层搜索树的结构。

// empty (1) ⽆参默认构造
explicit map(const key_compare& comp = key_compare(),
   const allocator_type& alloc = allocator_type());

// range (2) 迭代器区间构造
template <class InputIterator>
map(InputIterator first, InputIterator last,
   const key_compare& comp = key_compare(),
   const allocator_type & = allocator_type());

// copy (3) 拷⻉构造
map(const map& x);

// initializer list (5) initializer 列表构造
map(initializer_list<value_type> il,
   const key_compare& comp = key_compare(),
   const allocator_type& alloc = allocator_type());

// 迭代器是⼀个双向迭代器
iterator->a bidirectional iterator to const value_type

// 正向迭代器
iterator begin();
iterator end();
// 反向迭代器

reverse_iterator rbegin();
reverse_iterator rend();

4.5 map的增删查

map的增删查关注以下⼏个接⼝即可：

map增接⼝，插⼊的pair键值对数据，跟set所有不同，但是查和删的接⼝只⽤关键字key跟set是完全类似的，不过find返回iterator，不仅仅可以确认key在不在，还找到key映射的value，同时通过迭代还可以修改value

Member types
key_type->The first template parameter(Key)
mapped_type->The second template parameter(T)
value_type->pair<const key_type, mapped_type>

// 单个数据插⼊，如果已经key存在则插⼊失败,key存在相等value不相等也会插⼊失败
pair<iterator, bool> insert(const value_type& val);

// 列表插⼊，已经在容器中存在的值不会插⼊
void insert(initializer_list<value_type> il);

// 迭代器区间插⼊，已经在容器中存在的值不会插⼊
template <class InputIterator>
void insert(InputIterator first, InputIterator last);

// 查找k，返回k所在的迭代器，没有找到返回end()
iterator find(const key_type& k);

// 查找k，返回k的个数
size_type count(const key_type& k) const;

// 删除⼀个迭代器位置的值
iterator erase(const_iterator position);

// 删除k，k存在返回0，存在返回1
size_type erase(const key_type& k);

// 删除⼀段迭代器区间的值
iterator erase(const_iterator first, const_iterator last);

// 返回⼤于等k位置的迭代器
iterator lower_bound(const key_type& k);

// 返回⼤于k位置的迭代器
const_iterator lower_bound(const key_type& k) const;

4.6 map的数据修改

前⾯我提到map⽀持修改mapped_type 数据，不⽀持修改key数据，修改关键字数据，破坏了底层搜索树的结构。

map第⼀个⽀持修改的⽅式时通过迭代器，迭代器遍历时或者find返回key所在的iterator修改，map 还有⼀个⾮常重要的修改接⼝operator[]，但是operator[]不仅仅⽀持修改，还⽀持插⼊数据和查找数据，所以他是⼀个多功能复合接⼝

需要注意从内部实现⻆度，map这⾥把我们传统说的value值，给的是T类型，typedef为mapped_type。⽽value_type是红⿊树结点中存储的pair键值对值。⽇常使⽤我们还是习惯将这⾥的 T映射值叫做value。

Member types
key_type->The first template parameter(Key)
mapped_type->The second template parameter(T)
value_type->pair<const key_type, mapped_type>
// 查找k，返回k所在的迭代器，没有找到返回end()，如果找到了通过iterator可以修改key对应的
mapped_type值
iterator find(const key_type& k);
// ⽂档中对insert返回值的说明
// The single element versions (1) return a pair, with its member pair::first
set to an iterator pointing to either the newly inserted element or to the
element with an equivalent key in the map.The pair::second element in the pair
is set to true if a new element was inserted or false if an equivalent key
already existed.

// insert插⼊⼀个pair<key, T>对象
// 1、如果key已经在map中，插⼊失败，则返回⼀个pair<iterator,bool>对象，返回pair对象
first是key所在结点的迭代器，second是false
// 2、如果key不在在map中，插⼊成功，则返回⼀个pair<iterator,bool>对象，返回pair对象
first是新插⼊key所在结点的迭代器，second是true
// 也就是说⽆论插⼊成功还是失败，返回pair<iterator,bool>对象的first都会指向key所在的迭
代器
// 那么也就意味着insert插⼊失败时充当了查找的功能，正是因为这⼀点，insert可以⽤来实现operator[]
// 需要注意的是这⾥有两个pair，不要混淆了，⼀个是map底层红⿊树节点中存的pair<key, T>，另
⼀个是insert返回值pair<iterator, bool>
pair<iterator, bool> insert(const value_type & val);
mapped_type& operator[] (const key_type& k);

// operator的内部实现
mapped_type& operator[] (const key_type& k)
{
   // 1、如果k不在map中，insert会插⼊k和mapped_type默认值，同时[]返回结点中存储
   mapped_type值的引⽤，那么我们可以通过引⽤修改返映射值。所以[]具备了插⼊ + 修改功能
       // 2、如果k在map中，insert会插⼊失败，但是insert返回pair对象的first是指向key结点的
       迭代器，返回值同时[]返回结点中存储mapped_type值的引⽤，所以[]具备了查找 + 修改的功能
       pair<iterator, bool> ret = insert({ k, mapped_type() });
   iterator it = ret.first;
   return it->second;
}

4.7 构造遍历及增删查使用样例

#include<iostream>
#include<map>
using namespace std;
int main()
{// initializer_list构造及迭代遍历map<string, string> dict = { {"left", "左边"}, {"right", "右边"},{"insert", "插入"},{ "string", "字符串" } };//map<string, string>::iterator it = dict.begin();auto it = dict.begin();while (it != dict.end()){//cout << (*it).first <<":"<<(*it).second << endl;// map的迭代基本都使⽤operator->,这⾥省略了⼀个->// 第⼀个->是迭代器运算符重载，返回pair*，第⼆个箭头是结构指针解引⽤取pair数据//cout << it.operator->()->first << ":" << it.operator->()-> second << endl;cout << it->first << ":" << it->second << endl;++it;}cout << endl;// insert插⼊pair对象的4种⽅式，对⽐之下，最后⼀种最⽅便pair<string, string> kv1("first", "第一个");dict.insert(kv1);dict.insert(pair<string, string>("second", "第二个"));dict.insert(make_pair("sort", "排序"));dict.insert({ "auto", "自动的" });// "left"已经存在，插⼊失败dict.insert({ "left", "左边，剩余" });// 范围for遍历for (const auto& e : dict){cout << e.first << ":" << e.second << endl;}cout << endl;string str;while (cin >> str){auto ret = dict.find(str);if (ret != dict.end()){cout << "->" << ret->second << endl;}else{cout << "无此单词，请重新输入" << endl;}}// erase等接⼝跟set完全类似，这⾥就不演⽰讲解了return 0;
}

运行结果：

4.8 map的迭代器和[]功能样例：

#include<iostream>
#include<map>
#include<string>
using namespace std;
int main()
{// 利⽤find和iterator修改功能，统计⽔果出现的次数string arr[] = { "苹果", "西瓜", "苹果", "西瓜", "苹果", "苹果", "西瓜","苹果", "香蕉", "苹果", "香蕉" };map<string, int> countMap;for (const auto& str : arr){// 先查找⽔果在不在map中// 1、不在，说明⽔果第⼀次出现，则插⼊{⽔果, 1}// 2、在，则查找到的节点中⽔果对应的次数++auto ret = countMap.find(str);if (ret == countMap.end()){countMap.insert({ str, 1 });}else{ret->second++;}}for (const auto& e : countMap){cout << e.first << ":" << e.second << endl;}cout << endl;return 0;

运行结果：

还可以这样

#include<iostream>
#include<map>
#include<string>
using namespace std;
int main()
{// 利⽤[]插⼊+修改功能，巧妙实现统计⽔果出现的次数string arr[] = { "苹果", "西⽠", "苹果", "西⽠", "苹果", "苹果", "西⽠","苹果", "⾹蕉", "苹果", "⾹蕉" };map<string, int> countMap;for (const auto& str : arr){// []先查找⽔果在不在map中// 1、不在，说明⽔果第⼀次出现，则插⼊{⽔果, 0}，同时返回次数的引⽤，++⼀下就变成1次了// 2、在，则返回⽔果对应的次数++countMap[str]++;}for (const auto& e : countMap){cout << e.first << ":" << e.second << endl;}cout << endl;return 0;
}

运行结果：

#include<iostream>
#include<map>
#include<string>
using namespace std;
int main()
{map<string, string> dict;dict.insert(make_pair("sort", "排序"));// key不存在->插⼊ {"insert", string()}dict["insert"];// 插⼊+修改dict["left"] = "左边";// 修改dict["left"] = "左边、剩余";// key存在->查找cout << dict["left"] << endl;return 0;
}

运行结果：

5.multimap和map的差异

multimap和map的使⽤基本完全类似，主要区别点在于multimap⽀持关键值key冗余，那么 insert/find/count/erase都围绕着⽀持关键值key冗余有所差异，这⾥跟set和multiset完全⼀样，⽐如 find时，有多个key，返回中序第⼀个。其次就是multimap不⽀持[]，因为⽀持key冗余，[]就只能⽀持插⼊了，不能⽀持修改。

5.1 随机链表的复制

随机链表的复制

题目描述：

数据结构初阶阶段，为了控制随机指针，我们将拷⻉结点链接在原节点的后⾯解决，后⾯拷⻉节点还得解下来链接，⾮常⿇烦。这⾥我们直接让{原结点,拷⻉结点}建⽴映射关系放到map中，控制随机指针会⾮常简单⽅便，这⾥体现了map在解决⼀些问题时的价值，完全是降维打击。

代码如下：

/*
// Definition for a Node.
class Node {
public:int val;Node* next;Node* random;Node(int _val) {val = _val;next = NULL;random = NULL;}
};
*/class Solution {
public:Node* copyRandomList(Node* head) {map<Node*,Node*>nodeMap;Node*copyhead=nullptr,*copytaill=nullptr;Node*cur = head;while(cur){if(copytaill == nullptr){copyhead = copytaill = new Node(cur->val);}else{copytaill->next = new Node (cur->val);copytaill = copytaill->next;}// 原节点和拷⻉节点map kv存储nodeMap[cur] = copytaill;cur = cur->next;}// 处理randomcur = head;Node* copy = copyhead;while(cur){if(nodeMap[cur->random] == nullptr){copy->random = nullptr;}else{copy->random = nodeMap[cur->random];}cur = cur->next;copy = copy->next;}return copyhead;}
};

5.2 692.前K个高频单词

692.前K个高频单词

题目描述：

本题⽬我们利⽤map统计出次数以后，返回的答案应该按单词出现频率由⾼到低排序，有⼀个特殊要求，如果不同的单词有相同出现频率，按字典顺序排序。

解决思路1：⽤排序找前k个单词，因为map中已经对key单词排序过，也就意味着遍历map时，次数相同的单词，字典序⼩的在前⾯，字典序⼤的在后⾯。那么我们将数据放到vector中⽤⼀个稳定的排序就可以实现上⾯特殊要求，但是sort底层是快排，是不稳定的，所以我们要⽤stable_sort，他是稳定的。

代码如下：

class Solution {
public:struct Compare{bool operator()(const pair<string, int>& x, const pair<string, int>& y)const{return x.second > y.second;}};vector<string> topKFrequent(vector<string>& words, int k) {map<string, int> countMap;for (auto& e : words){countMap[e]++;}vector<pair<string, int>> v(countMap.begin(), countMap.end());// 仿函数控制降序stable_sort(v.begin(), v.end(), Compare());//sort(v.begin(), v.end(), Compare());// 取前k个vector<string> strV;for (int i = 0; i < k; ++i){strV.push_back(v[i].first);}return strV;}
};

解决思路2：

将map统计出的次数的数据放到vector中排序，或者放到priority_queue中来选出前k个。利⽤仿函数强⾏控制次数相等的，字典序⼩的在前⾯。

class Solution {
public:struct Compare{bool operator()(const pair<string, int>& x, const pair<string, int>& y)const{return x.second > y.second || (x.second == y.second && x.first <y.first);;}};vector<string> topKFrequent(vector<string>& words, int k) {map<string, int> countMap;for (auto& e : words){countMap[e]++;}vector<pair<string, int>> v(countMap.begin(), countMap.end());// 仿函数控制降序,仿函数控制次数相等，字典序⼩的在前⾯sort(v.begin(), v.end(), Compare());// 取前k个vector<string> strV;for (int i = 0; i < k; ++i){strV.push_back(v[i].first);}return strV;}
};

用优先级队列

class Solution {
public:struct Compare{bool operator()(const pair<string, int>& x, const pair<string, int>& y)const{// 要注意优先级队列底层是反的，⼤堆要实现⼩于⽐较，所以这⾥次数相等，想要字典序⼩的在前⾯要⽐较字典序⼤的为真return x.second < y.second || (x.second == y.second && x.first >y.first);}};vector<string> topKFrequent(vector<string>& words, int k) {map<string, int> countMap;for (auto& e : words){countMap[e]++;}// 将map中的<单词，次数>放到priority_queue中，仿函数控制⼤堆，次数相同按照字典序规则排序priority_queue<pair<string, int>, vector<pair<string, int>>, Compare>p(countMap.begin(), countMap.end());vector<string> strV;for (int i = 0; i < k; ++i){strV.push_back(p.top().first);p.pop();}return strV;}
};

结束语：

总的来说，map 和 set 是 C++ STL 中非常强大且高效的容器，它们通过基于红黑树的实现保证了数据的有序性和操作的高效性。在处理需要频繁插入、查找和删除的任务时，map 和 set 提供了理想的解决方案。

无论是存储键值对的 map，还是存储唯一元素的 set，它们的时间复杂度始终保持在 O(log n)，适用于很多实际应用场景，如字典、集合运算、频率计数等。

最后感谢大家的支持