Python - 深夜数据结构与算法之 LRUCache

一.引言

二.LRU Cache 简介

1.实现特性

2.工作流程

三.LRU Cache 实战

1.HashMap + ListNode

2.OrderedDict

四.总结

一.引言

LRU 即 Least Recently Used 意为最近使用，它是一种局部 Cache 的缓存方法，用于存储最近使用的元素，随着 Cache 中元素的增加，LRU Cache 会逐步挪去结尾的近期未使用的元素。

二.LRU Cache 简介

1.实现特性

LRU Cache 一般由两个要素决定，大小与替换策略。

- 大小 即 Cache 的容量，如果 Cache 非常大，像内存一样，那我们直接一直存就可以了

- 替换 由于 Cache 一般不会无限制的增加，所以达到容量时，就需要根据策略进行替换删除元素

其实现一般通过 HashMap + Double LinkedList 即 Map + 双端链表实现。

2.工作流程

假设 Cache 的容量为 5，前面 A-E 正常记录 Cache 缓存，而当 F 插入时，根据 LRU 即最近最少使用的原则，排在最早且未被再次使用的 A 会被移出 Cache，而当 C 再次插入时，其会从 -2 的索引提到最前方，此时不涉及元素出链表，只是修改其顺序，后面以此类推。

除了 LRU 外，还有 LFU 其全称为 Least Frequently Used 代表最近使用频次最小，此时需要维护 Cache 内每个元素的使用频次，这里我们介绍 LRU，所以 LFU 就简单带过一下。完整的替换算法大家有兴趣可以 🪜 看一下 Wiki: Cache_replacement_policies。

三.LRU Cache 实战

实现: https://leetcode.cn/problems/lru-cache/description/

◆ 题目分析

实现容量为 Capacity 的 LRU Cache，基于前面的简介，下面我们分别使用 Python 库函数和手动双端队列实现。

1.HashMap + ListNode

class ListNode:# 双向链表def __init__(self, key=None, value=None):self.key = keyself.value = valueself.pre = Noneself.next = Noneclass LRUCache(object):def __init__(self, capacity):self.capacity = capacityself.hashmap = {}# 新建 head/tail 节点self.head = ListNode()self.tail = ListNode()# 初始化self.head.next = self.tailself.tail.pre = self.head# get put 都可能需要把元素一到末尾，所以添加辅助方法def move_node_to_end(self, key):# 获取当前节点node = self.hashmap[key]# Pre -> Node -> Next  =>  Pre <-> Nextnode.pre.next = node.nextnode.next.pre = node.pre# pre -> tail  => pre <-> node <-> tailnode.pre = self.tail.prenode.next = self.tailself.tail.pre.next = nodeself.tail.pre = nodedef get(self, key):""":type key: int:rtype: int"""# 在链表就移到末尾if key in self.hashmap:self.move_node_to_end(key)res = self.hashmap.get(key, -1)if res == -1:return reselse:return res.value def put(self, key, value):""":type key: int:type value: int:rtype: None"""if key in self.hashmap:# 无需添加元素，但是需要更新 value 并移动self.hashmap[key].value = valueself.move_node_to_end(key)else:cur_capacity = len(self.hashmap)if cur_capacity == self.capacity:self.hashmap.pop(self.head.next.key)# 更新前面的元素前后指针self.head.next = self.head.next.nextself.head.next.pre = self.head# 容量支持插入新元素new = ListNode(key, value)self.hashmap[key] = newnew.pre = self.tail.prenew.next = self.tailself.tail.pre.next = newself.tail.pre = new# Your LRUCache object will be instantiated and called as such:
# obj = LRUCache(capacity)
# param_1 = obj.get(key)
# obj.put(key,value)

代码比较长，但是都是基础的双端链表增删节点的操作，大家可以画示意图查看，更加直观。

2.OrderedDict

class LRUCache(collections.OrderedDict):def __init__(self, capacity: int):super().__init__()self.capacity = capacitydef get(self, key: int) -> int:if key not in self:return -1self.move_to_end(key)return self[key]def put(self, key: int, value: int) -> None:if key in self:self.move_to_end(key)self[key] = valueif len(self) > self.capacity:self.popitem(last=False)

Collections 的 OrderedDict 默认包含了上面的功能，如果是在工作中可以这么调用。