Ubuntu Python正则表达式使用指南

在Ubuntu系统中使用Python正则表达式，通常涉及以下步骤：

导入re模块：
```
import re
```
编译正则表达式（可选但推荐）：
使用re.compile()函数可以预编译正则表达式，提高匹配效率。
```
pattern = re.compile(r'\d+')  # 编译一个匹配一个或多个数字的模式
```

匹配字符串：
使用match()方法从字符串开头尝试匹配。

match_result = pattern.match('123abc')
if match_result:
    print(match_result.group())  # 输出: 123

搜索字符串：
使用search()方法在整个字符串中搜索匹配项。

search_result = pattern.search('123abc456')
if search_result:
    print(search_result.group())  # 输出: 123

查找所有匹配项：
使用findall()方法返回所有匹配的子串列表。

all_matches = pattern.findall('The numbers are 123, 456, and 789.')
print(all_matches)  # 输出: ['123', '456', '789']

替换文本：
使用sub()方法替换匹配项。

replaced_text = re.sub(r'\d+', 'NUMBER', 'My phone number is 1234567890.')
print(replaced_text)  # 输出: My phone number is NUMBER

分割字符串：
使用split()方法根据正则表达式分割字符串。

split_result = re.split(r'\s+', 'Hello world Python')
print(split_result)  # 输出: ['Hello', 'world', 'Python']

使用特殊字符和元字符：
正则表达式中有许多特殊字符和元字符，如.表示任意字符，*表示匹配零次或多次等。

# 匹配任意字符
pattern = re.compile('.')
print(pattern.match('abc').group())  # 输出: a

# 匹配一个点
pattern = re.compile('\\.')
print(pattern.match('abc.').group())  # 输出: .

分组和捕获：
使用圆括号()创建分组，可以捕获匹配的子串。

pattern = re.compile(r'(\d+)-(\d+)')
match = pattern.search('The numbers are 123-456.')
if match:
    print(match.group(1))  # 输出: 123
    print(match.group(2))  # 输出: 456

使用编译标志：
如re.IGNORECASE进行不区分大小写的匹配。

pattern = re.compile(r'hello', re.IGNORECASE)
result = pattern.search('HELLO world')
if result:
    print('匹配成功')  # 输出: 匹配成功