Python 正则表达式核心用法与实战示例 | 极客日志

Python算法

Python 正则表达式核心用法与实战示例

综述由AI生成Python 正则表达式的核心用法与实战技巧。内容涵盖基础匹配函数 search 与 findall，字符类如数字与小数匹配，量星与锚点的运用，以及分组捕获机制。同时讲解了 split 与 sub 函数的应用，并补充了原始字符串、预编译模式等最佳实践。文中提供了邮箱、手机号及 URL 提取等常见场景的代码示例，旨在帮助开发者高效解决文本处理问题。

DebugKing发布于 2025/2/7更新于 2026/6/218 浏览

Python 正则表达式核心用法与实战示例

字符串处理是编程中的基础且高频场景，正则表达式（Regular Expression）提供了简洁高效的文本匹配方案。Python 内置的 re 模块为开发者提供了完整的正则支持。本文将系统梳理 Python 正则表达式的常用语法、函数及实战技巧，帮助读者掌握这一核心工具。

1. 基础匹配函数

1.1 查找第一个匹配项

使用 re.search() 在字符串中查找第一个符合模式的位置。如果找到，返回一个 Match 对象；否则返回 None。

import re

s = 'i love python very much'
pattern = 'python'
result = re.search(pattern, s)
if result:
    print(f"匹配位置：{result.span()}")  # 输出：(7, 13)
    print(f"匹配内容：{result.group()}")  # 输出：python

1.2 查找所有匹配项

使用 re.finditer() 或 re.findall() 获取所有匹配结果。

s = '山东省潍坊市青州第 1 中学高三 1 班'
pattern = '1'
# finditer 返回迭代器，包含 Match 对象
for match in re.finditer(pattern, s):
    print(f"索引：{match.span()}, 内容：{match.group()}")
# 输出：
# 索引：(9, 10), 内容：1
# 索引：(14, 15), 内容：1

2. 字符类与量词

2.1 数字匹配

\d 代表任意数字（等价于 [0-9]），配合量词可匹配多位数。

s = '一共 20 行代码运行时间 13.59s'
pattern = r'\d+'  # \d+ 表示匹配一个或多个数字
matches = re.findall(pattern, s)
print(matches)

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
curl 转代码
解析常见 curl 参数并生成 fetch、axios、PHP curl 或 Python requests 示例代码。在线工具，curl 转代码在线工具，online
Base64 字符串编码/解码
将字符串编码和解码为其 Base64 格式表示形式即可。在线工具，Base64 字符串编码/解码在线工具，online
Base64 文件转换器
将字符串、文件或图像转换为其 Base64 表示形式。在线工具，Base64 文件转换器在线工具，online
Markdown转HTML
将 Markdown（GFM）转为 HTML 片段，浏览器内 marked 解析；与 HTML转Markdown 互为补充。在线工具，Markdown转HTML在线工具，online

s = '一共 20 行代码运行时间 13.59s'
pattern = r'\d+\.?\d*'  # ? 表示前一个字符出现 0 次或 1 次
matches = re.findall(pattern, s)
print(matches)  # ['20', '13.59']

s = 'This module provides regular expression matching operations similar to those found in Perl'
pattern = r'^[emrt]'  # 查找以 e, m, r, t 开头的单词
matches = re.findall(pattern, s)
print(matches)  # []，因为首字母是 T

s = 'This module provides regular expression matching operations similar to those found in Perl'
pattern = r'^[emrt]'
# 编译时添加 IGNORECASE 标志
compiled = re.compile(pattern, re.I)
result = compiled.search(s)
if result:
    print(result.group())  # 输出：T

s = 'This module provides regular expression matching operations similar to those found in Perl'
pattern = r'\s([a-zA-Z]+)'  # 捕获空格后的单词
matches = re.findall(pattern, s)
print(matches) 
# ['module', 'provides', 'regular', ...]

s = 'This module provides regular expression matching operations similar to those found in Perl'
pattern = r'\s?([a-zA-Z]+)'  # 空格可选
matches = re.findall(pattern, s)
print(matches) 
# ['This', 'module', 'provides', ...]

s = 'color red and color blue'
pattern = r'(?:red|blue)'
matches = re.findall(pattern, s)
print(matches)  # ['red', 'blue']

s = 'This module provides regular expression matching operations similar to those found in Perl'
pattern = r'\s+'
words = re.split(pattern, s)
print(words)  # ['This', 'module', 'provides', ...]

s = '价格：100 元，折扣后：80 元'
pattern = r'(\d+) 元'
def replace_price(match):
    price = int(match.group(1))
    return f'{price * 0.8}元'

result = re.sub(pattern, replace_price, s)
print(result)  # 价格：80.0 元，折扣后：64.0 元

# 推荐
pattern = r'\d+'
# 不推荐（需要双反斜杠）
pattern = '\\\d+'

import re
pattern = re.compile(r'\w+@\w+\.com')
result = pattern.search('[email protected]')

email_pattern = r'[a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+\.[a-zA-Z0-9-.]+'

phone_pattern = r'1[3-9]\d{9}'

url_pattern = r'https?://(?:www\.)?[a-zA-Z0-9.-]+'

Python 正则表达式核心用法与实战示例

Python 正则表达式核心用法与实战示例

1. 基础匹配函数

1.1 查找第一个匹配项

1.2 查找所有匹配项

2. 字符类与量词

2.1 数字匹配

更多推荐文章

相关免费在线工具

2.2 小数匹配

2.3 量词详解

3. 锚点与边界

3.1 开头与结尾

3.2 忽略大小写

4. 分组与捕获

4.1 提取单词

4.2 包含首单词

4.3 非捕获分组

5. 分割与替换

5.1 分割字符串

5.2 替换内容

6. 进阶技巧与最佳实践

6.1 原始字符串

6.2 预编译模式

6.3 常见实战模式

邮箱验证

手机号验证（中国大陆）

URL 提取

7. 总结

更多推荐文章

相关免费在线工具

Python 正则表达式核心用法与实战示例

Python 正则表达式核心用法与实战示例

1. 基础匹配函数

1.1 查找第一个匹配项

1.2 查找所有匹配项

2. 字符类与量词

2.1 数字匹配

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

2.2 小数匹配

2.3 量词详解

3. 锚点与边界

3.1 开头与结尾

3.2 忽略大小写

4. 分组与捕获

4.1 提取单词

4.2 包含首单词

4.3 非捕获分组

5. 分割与替换

5.1 分割字符串

5.2 替换内容

6. 进阶技巧与最佳实践

6.1 原始字符串

6.2 预编译模式

6.3 常见实战模式

邮箱验证

手机号验证（中国大陆）

URL 提取

7. 总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具