Python 拒绝采样算法优化与多峰分布模拟实战

Python 拒绝采样算法优化与多峰分布模拟

拒绝采样（Rejection Sampling）是一种从复杂分布中生成样本的蒙特卡洛方法，也是 LLM 微调优化的关键技术之一。本文基于 Python 模拟并对比多种拒绝采样的实现策略，重点探讨如何通过优化提议分布和参数选择来提升采样效率。

核心原理与优化方向

拒绝采样的核心在于从提议分布（Proposal Distribution）中生成样本，并以一定概率接受或拒绝。关键约束是存在常数 M，使得目标概率密度函数 target_pdf(x) 始终小于等于 M * proposal_pdf(x)。

常见的优化思路包括：

自适应调整：根据已接受的样本动态更新提议分布的参数，自动优化 M 值。
分层采样：将定义域划分为多个子区域，为每个区域匹配最合适的提议分布，平衡采样数量。
混合分布：使用多个提议分布的加权和，特别适合匹配多峰目标分布。

Python 模拟实现

为了直观展示不同策略的效果，我们构建了一个基于双峰高斯混合分布的目标模型，并实现了基础、自适应、分层及混合四种采样器。

环境准备与目标分布定义

首先引入必要的科学计算库，并定义目标概率密度函数。这里我们设定一个由两个高斯分布组成的混合模型：

import numpy as np
import matplotlib.pyplot as plt
from scipy import stats
import time

# 设置绘图风格
plt.rcParams['font.family'] = 'Arial Unicode MS'

# 定义目标分布：混合高斯分布
def target_pdf(x):
    return 0.7 * stats.norm.pdf(x, -2, 1) + 0.3 * stats.norm.pdf(x, 3, 1.5)

基础拒绝采样

这是最经典的实现方式。我们需要确定一个足够大的 M 值来覆盖整个目标分布。虽然简单，但在高维或多峰场景下，接受率往往较低。

def basic_rejection_sampling(target_pdf, proposal_pdf, proposal_rv, M, n_samples=1000, max_iter=10000):
    samples = []
    attempts = 0
    while len(samples) < n_samples  attempts < max_iter:
        attempts += 
        x = proposal_rv()
        u = np.random.uniform(, )
        
         u < target_pdf(x) / (M * proposal_pdf(x)):
            samples.append(x)
    
    acceptance_rate = (samples) / attempts  attempts >   
     np.array(samples), acceptance_rate

class AdaptiveRejectionSampler: def __init__(self, target_pdf, initial_proposal, domain=(-10, 10)): self.target_pdf = target_pdf self.proposal_pdf = initial_proposal['pdf'] self.proposal_rv = initial_proposal['rv'] self.domain = domain self.M_history = [] def find_optimal_M(self, n_test=1000): test_points = np.linspace(self.domain[0], self.domain[1], n_test) ratios = [self.target_pdf(x) / self.proposal_pdf(x) for x in test_points] return np.max(ratios) * 1.1 if ratios else 2.0 def adaptive_sampling(self, n_samples=1000, adapt_every=100): samples = [] total_attempts = 0 M = self.find_optimal_M() self.M_history.append(M) for batch in range(0, n_samples, adapt_every): batch_size = min(adapt_every, n_samples - len(samples)) batch_samples, batch_attempts = self._sample_batch(batch_size, M) samples.extend(batch_samples) total_attempts += batch_attempts # 每接受一定数量样本后更新提议分布 if len(samples) > 10: self._update_proposal(samples) M = self.find_optimal_M() self.M_history.append(M) acceptance_rate = len(samples) / total_attempts if total_attempts > 0 else 0 return np.array(samples), acceptance_rate def _sample_batch(self, batch_size, M): samples = [] attempts = 0 while len(samples) < batch_size and attempts < batch_size * 100: attempts += 1 x = self.proposal_rv() u = np.random.uniform(0, 1) if u < self.target_pdf(x) / (M * self.proposal_pdf(x)): samples.append(x) return samples, attempts def _update_proposal(self, accepted_samples): mean = np.mean(accepted_samples) std = np.std(accepted_samples) + 0.1 self.proposal_pdf = lambda x: stats.norm.pdf(x, mean, std) self.proposal_rv = lambda: np.random.normal(mean, std)

方法	接受率	耗时 (秒)
基础拒绝采样	~0.176	0.65
自适应拒绝采样	~0.344	1.05
分层拒绝采样	~0.067	2.45
混合提议分布	~0.461	0.49

Python 拒绝采样算法优化与多峰分布模拟实战