Rust 异步缓存系统的设计与实现 | 极客日志

Rust算法

Rust 异步缓存系统的设计与实现

Rust 异步缓存系统利用 Tokio 运行时和 Arc+Mutex 实现并发安全。文章涵盖 LRU/TTL 策略设计、基础数据结构实现、多服务集成（用户/订单/监控）及性能优化（原子操作/批量处理）。同时提供缓存穿透、击穿、雪崩的解决方案，如布隆过滤器与互斥锁机制，助力构建高可靠缓存架构。

链路追踪发布于 2026/3/29更新于 2026/6/1719 浏览

Rust 异步缓存系统的设计与实现

引言

缓存是现代 Web 应用架构中的核心组件，能够显著提升系统的性能和响应速度。通过将频繁访问的数据存储在高速缓存中，可以减少对数据库或外部 API 的请求，从而降低延迟和提高吞吐量。Rust 语言的异步特性和内存安全保障使得它非常适合用于构建高性能、可靠的异步缓存系统。

本文将深入探讨异步缓存系统的设计与实现，包括缓存策略、数据结构选择、并发安全保障、内存管理、错误处理和过期机制等方面。我们还将通过实战项目演示如何在用户同步服务、订单处理服务和监控服务中使用异步缓存系统，以及如何优化缓存系统的性能。

异步缓存系统的核心概念

缓存策略

缓存策略决定了数据在缓存中的存储和淘汰方式，常见的缓存策略包括：

LRU（Least Recently Used）：最近最少使用策略，淘汰最近最少使用的数据。
LFU（Least Frequently Used）：最不经常使用策略，淘汰使用频率最低的数据。
FIFO（First In First Out）：先进先出策略，淘汰最早进入缓存的数据。
TTL（Time To Live）：存活时间策略，数据在缓存中存储一定时间后自动过期。

异步操作的特点

异步缓存系统的异步操作具有以下特点：

非阻塞性：异步操作不会阻塞线程，提高了系统的并发能力。
高吞吐量：异步操作可以同时处理多个请求，提高了系统的吞吐量。
资源利用率：异步操作可以更有效地利用 CPU 和内存资源。

并发安全

异步缓存系统需要处理多个任务同时访问共享数据的情况，因此需要确保并发安全。Rust 提供了多种并发安全的工具，如 Arc、Mutex、RwLock 和原子类型。

核心设计与实现

1. 并发安全设计

异步缓存系统需要确保多个任务同时访问共享数据时不会发生数据竞争。我们可以使用 Arc 与 Mutex 或 RwLock 来实现线程安全的共享。

use std::sync::Arc;
use tokio::sync::Mutex;
use std::collections::HashMap;

#[derive(Clone)]
pub struct Cache<K, V> {
    data: Arc<Mutex<HashMap<K, V>>>,
}

impl<K, V> Cache<K, V>
where
    K: std::hash::Hash + Eq + Clone,
    V: Clone,
{
    pub fn new() -> Self {
        Cache {
            data: Arc::new(Mutex::(HashMap::())),
        }
    }

       (&, key: K)  <V> {
          = .data.().;
        data.(&key).()
    }

       (&, key: K, value: V) {
          = .data.().;
        data.(key, value);
    }

       (&, key: K) {
          = .data.().;
        data.(&key);
    }
}

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
Base64 字符串编码/解码
将字符串编码和解码为其 Base64 格式表示形式即可。在线工具，Base64 字符串编码/解码在线工具，online
Base64 文件转换器
将字符串、文件或图像转换为其 Base64 表示形式。在线工具，Base64 文件转换器在线工具，online
Markdown转HTML
将 Markdown（GFM）转为 HTML 片段，浏览器内 marked 解析；与 HTML转Markdown 互为补充。在线工具，Markdown转HTML在线工具，online
HTML转Markdown
将 HTML 片段转为 GitHub Flavored Markdown，支持标题、列表、链接、代码块与表格等；浏览器内处理，可链接预填。在线工具，HTML转Markdown在线工具，online

use thiserror::Error;

#[derive(Error, Debug)]
pub enum CacheError {
    #[error("Key not found")]
    KeyNotFound,
    #[error("Invalid key")]
    InvalidKey,
    #[error("Cache operation failed")]
    OperationFailed,
    #[error(transparent)]
    Unexpected(#[from] anyhow::Error),
}

use std::sync::Arc;
use tokio::sync::Mutex;
use std::collections::HashMap;
use std::time::{Duration, SystemTime};
use tokio::time;

#[derive(Clone)]
pub struct CacheEntry<V> {
    value: V,
    expiration: SystemTime,
}

impl<V> CacheEntry<V> {
    pub fn new(value: V, ttl: Duration) -> Self {
        let expiration = SystemTime::now() + ttl;
        CacheEntry { value, expiration }
    }

    pub fn is_expired(&self) -> bool {
        SystemTime::now() > self.expiration
    }
}

#[derive(Clone)]
pub struct Cache<K, V> {
    data: Arc<Mutex<HashMap<K, CacheEntry<V>>>>,
    ttl: Duration,
}

impl<K, V> Cache<K, V>
where
    K: std::hash::Hash + Eq + Clone + Send + Sync,
    V: Clone + Send + Sync,
{
    pub fn new(ttl: Duration) -> Self {
        let cache = Cache {
            data: Arc::new(Mutex::new(HashMap::new())),
            ttl,
        };
        cache.start_cleanup_task();
        cache
    }

    fn start_cleanup_task(&self) {
        let data = self.data.clone();
        let ttl = self.ttl;
        tokio::spawn(async move {
            loop {
                time::sleep(ttl).await;
                let mut data = data.lock().await;
                data.retain(|_, entry| !entry.is_expired());
            }
        });
    }

    pub async fn get(&self, key: K) -> Option<V> {
        let mut data = self.data.lock().await;
        if let Some(entry) = data.get(&key) {
            if entry.is_expired() {
                data.remove(&key);
                None
            } else {
                Some(entry.value.clone())
            }
        } else {
            None
        }
    }

    pub async fn put(&self, key: K, value: V) {
        let mut data = self.data.lock().await;
        let entry = CacheEntry::new(value, self.ttl);
        data.insert(key, entry);
    }

    pub async fn remove(&self, key: K) {
        let mut data = self.data.lock().await;
        data.remove(&key);
    }
}

// user-sync-service/src/cache.rs
use crate::sync::{ThirdPartyUser, sync_users};
use crate::config::Config;
use async_cache::{Cache, CacheError};
use std::time::Duration;

pub struct UserCache {
    cache: Cache<i32, ThirdPartyUser>,
}

impl UserCache {
    pub fn new(ttl: Duration) -> Self {
        UserCache {
            cache: Cache::new(ttl),
        }
    }

    pub async fn get_user(&self, user_id: i32) -> Result<Option<ThirdPartyUser>, CacheError> {
        Ok(self.cache.get(user_id).await)
    }

    pub async fn put_user(&self, user_id: i32, user: ThirdPartyUser) -> Result<(), CacheError> {
        self.cache.put(user_id, user).await?;
        Ok(())
    }

    pub async fn remove_user(&self, user_id: i32) -> Result<(), CacheError> {
        self.cache.remove(user_id).await?;
        Ok(())
    }

    pub async fn sync_users(&self, config: &Config) -> Result<(), CacheError> {
        let third_party_users = sync_users(config).await?;
        for user in third_party_users {
            self.put_user(user.id, user).await?;
        }
        Ok(())
    }
}

// order-processing-service/src/cache.rs
use crate::process::{Order, Product};
use crate::config::Config;
use async_cache::{Cache, CacheError};
use std::time::Duration;

pub struct OrderCache {
    order_cache: Cache<i32, Order>,
    product_cache: Cache<i32, Product>,
}

impl OrderCache {
    pub fn new(order_ttl: Duration, product_ttl: Duration) -> Self {
        OrderCache {
            order_cache: Cache::new(order_ttl),
            product_cache: Cache::new(product_ttl),
        }
    }

    pub async fn get_order(&self, order_id: i32) -> Result<Option<Order>, CacheError> {
        Ok(self.order_cache.get(order_id).await)
    }

    pub async fn put_order(&self, order_id: i32, order: Order) -> Result<(), CacheError> {
        self.order_cache.put(order_id, order).await?;
        Ok(())
    }

    pub async fn remove_order(&self, order_id: i32) -> Result<(), CacheError> {
        self.order_cache.remove(order_id).await?;
        Ok(())
    }

    pub async fn get_product(&self, product_id: i32) -> Result<Option<Product>, CacheError> {
        Ok(self.product_cache.get(product_id).await)
    }

    pub async fn put_product(&self, product_id: i32, product: Product) -> Result<(), CacheError> {
        self.product_cache.put(product_id, product).await?;
        Ok(())
    }

    pub async fn remove_product(&self, product_id: i32) -> Result<(), CacheError> {
        self.product_cache.remove(product_id).await?;
        Ok(())
    }
}

// monitoring-service/src/cache.rs
use crate::monitor::{SystemState, PerformanceMetric};
use crate::config::Config;
use async_cache::{Cache, CacheError};
use std::time::Duration;

pub struct MonitoringCache {
    system_state_cache: Cache<String, SystemState>,
    performance_metric_cache: Cache<String, PerformanceMetric>,
}

impl MonitoringCache {
    pub fn new(system_state_ttl: Duration, performance_metric_ttl: Duration) -> Self {
        MonitoringCache {
            system_state_cache: Cache::new(system_state_ttl),
            performance_metric_cache: Cache::new(performance_metric_ttl),
        }
    }

    pub async fn get_system_state(&self, key: &str) -> Result<Option<SystemState>, CacheError> {
        Ok(self.system_state_cache.get(key.to_string()).await)
    }

    pub async fn put_system_state(&self, key: &str, state: SystemState) -> Result<(), CacheError> {
        self.system_state_cache.put(key.to_string(), state).await?;
        Ok(())
    }

    pub async fn remove_system_state(&self, key: &str) -> Result<(), CacheError> {
        self.system_state_cache.remove(key.to_string()).await?;
        Ok(())
    }

    pub async fn get_performance_metric(&self, key: &str) -> Result<Option<PerformanceMetric>, CacheError> {
        Ok(self.performance_metric_cache.get(key.to_string()).await)
    }

    pub async fn put_performance_metric(&self, key: &str, metric: PerformanceMetric) -> Result<(), CacheError> {
        self.performance_metric_cache.put(key.to_string(), metric).await?;
        Ok(())
    }

    pub async fn remove_performance_metric(&self, key: &str) -> Result<(), CacheError> {
        self.performance_metric_cache.remove(key.to_string()).await?;
        Ok(())
    }
}

use std::sync::atomic::{AtomicUsize, Ordering};

#[derive(Clone)]
pub struct CacheEntry<V> {
    value: V,
    expiration: SystemTime,
    access_count: AtomicUsize,
}

impl<V> CacheEntry<V> {
    pub fn new(value: V, ttl: Duration) -> Self {
        let expiration = SystemTime::now() + ttl;
        CacheEntry {
            value,
            expiration,
            access_count: AtomicUsize::new(0),
        }
    }

    pub fn increment_access_count(&self) {
        self.access_count.fetch_add(1, Ordering::Relaxed);
    }

    pub fn get_access_count(&self) -> usize {
        self.access_count.load(Ordering::Relaxed)
    }
}

pub async fn get_batch(&self, keys: Vec<K>) -> Vec<Option<V>> {
    let data = self.data.lock().await;
    keys.iter()
        .map(|key| {
            data.get(key)
                .filter(|entry| !entry.is_expired())
                .map(|entry| entry.value.clone())
        })
        .collect()
}

pub async fn put_batch(&self, items: Vec<(K, V)>) {
    let mut data = self.data.lock().await;
    for (key, value) in items {
        let entry = CacheEntry::new(value, self.ttl);
        data.insert(key, entry);
    }
}

use sqlx::PgPool;

pub struct DatabaseCache {
    pool: PgPool,
    cache: Cache<String, String>,
}

impl DatabaseCache {
    pub async fn new(url: &str, pool_size: u32, ttl: Duration) -> Result<Self, CacheError> {
        let pool = PgPool::connect_with(
            sqlx::postgres::PgConnectOptions::new()
                .url(url)
                .pool_options(sqlx::PoolOptions::new().max_connections(pool_size)),
        )
        .await?;
        Ok(DatabaseCache {
            pool,
            cache: Cache::new(ttl),
        })
    }
}

use bloomfilter::Bloom;

pub struct BloomFilterCache {
    bloom: Arc<Mutex<Bloom>>,
    cache: Cache<String, String>,
    db: sqlx::PgPool,
}

impl BloomFilterCache {
    pub async fn get(&self, key: &str) -> Result<Option<String>, CacheError> {
        let bloom = self.bloom.lock().await;
        if !bloom.contains(key) {
            return Ok(None);
        }
        drop(bloom);
        // ... logic to fetch from cache or db
        Ok(None)
    }
}

pub struct MutexCache {
    cache: Cache<String, String>,
    db: sqlx::PgPool,
    locks: Arc<Mutex<HashMap<String, tokio::sync::Mutex<()>>>>,
}

impl MutexCache {
    pub async fn get(&self, key: &str) -> Result<Option<String>, CacheError> {
        if let Some(value) = self.cache.get(key.to_string()).await {
            return Ok(Some(value));
        }
        let mut locks = self.locks.lock().await;
        let lock = locks.entry(key.to_string()).or_insert_with(|| tokio::sync::Mutex::new(()));
        drop(locks);
        let _guard = lock.lock().await;
        // Double check after acquiring lock
        if let Some(value) = self.cache.get(key.to_string()).await {
            return Ok(Some(value));
        }
        // Fetch from DB and update cache
        Ok(None)
    }
}

use rand::Rng;

impl<V> CacheEntry<V> {
    pub fn new(value: V, ttl: Duration) -> Self {
        let jitter = rand::thread_rng().gen_range(0..300) as u64;
        let expiration = SystemTime::now() + ttl + Duration::from_secs(jitter);
        CacheEntry { value, expiration }
    }
}

Rust 异步缓存系统的设计与实现

Rust 异步缓存系统的设计与实现

引言

异步缓存系统的核心概念

缓存策略

异步操作的特点

并发安全

核心设计与实现

1. 并发安全设计

更多推荐文章

相关免费在线工具

2. 内存管理与生命周期

3. 错误处理设计

4. 过期机制设计

实战项目集成

1. 用户同步服务的缓存集成

2. 订单处理服务的缓存集成

3. 监控服务的缓存集成

性能优化

1. 使用原子操作

2. 使用批量操作

3. 使用连接池

常见问题与解决方案

1. 缓存穿透

2. 缓存击穿

3. 缓存雪崩

总结

更多推荐文章

相关免费在线工具

Rust 异步缓存系统的设计与实现

Rust 异步缓存系统的设计与实现

引言

异步缓存系统的核心概念

缓存策略

异步操作的特点

并发安全

核心设计与实现

1. 并发安全设计

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

2. 内存管理与生命周期

3. 错误处理设计

4. 过期机制设计

实战项目集成

1. 用户同步服务的缓存集成

2. 订单处理服务的缓存集成

3. 监控服务的缓存集成

性能优化

1. 使用原子操作

2. 使用批量操作

3. 使用连接池

常见问题与解决方案

1. 缓存穿透

2. 缓存击穿

3. 缓存雪崩

总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具