一、数组创建基础方法

1.1 特殊值数组生成

在科学计算中，初始化特定值的数组是常见需求。NumPy提供了高效的生成接口：

全零数组

import numpy as np
# 指定形状生成全零数组
zeros_arr = np.zeros((3,4), dtype=np.float32)
# 输出：
# array([[0., 0., 0., 0.],
#        [0., 0., 0., 0.],
#        [0., 0., 0., 0.]], dtype=float32)
# 基于现有数组形状生成
template = np.ones((2,5))
zeros_like = np.zeros_like(template)

全一数组

ones_arr = np.ones((2,2,3))  # 支持多维数组
# 输出三维全一数组
# array([[[1., 1., 1.],
#          [1., 1., 1.]],
#         [[1., 1., 1.],
#          [1., 1., 1.]]])

1.2 数组复制技术对比

从现有数据创建数组时，需注意内存管理差异：

array()方法

总是创建新数组副本

适用于不同数据类型转换

original = [[1,2],[3,4]]
copy1 = np.array(original, dtype=np.float64)  # 显式类型转换

asarray()方法

仅在必要时创建副本

对已为ndarray的输入不复制

arr = np.arange(5)
copy2 = np.asarray(arr)  # 不创建新副本
copy2[0] = 100
print(arr[0])  # 输出100，原数组被修改

二、序列数组生成技术

2.1 等差序列生成

linspace()参数详解

# 生成0-2π之间的10个等分点
angles = np.linspace(0, 2*np.pi, 10, endpoint=False)
# endpoint=False时不包含终点

arange()步长控制

# 生成0-10(不含)的奇数序列
odd_numbers = np.arange(0, 10, 2)
# 输出：array([0, 2, 4, 6, 8])

2.2 等比序列生成

logspace()特别适合生成对数坐标数据：

# 生成10^0到10^3之间的5个对数间隔数
powers = np.logspace(0, 3, 5)
# 输出：array([   1.    ,    5.62341325,   31.6227766 ,  177.827941  , 1000.    ])

三、随机数生成体系

3.1 核心随机模块

np.random子模块提供全面的随机数生成能力：

基础随机数

# 生成[0,1)区间均匀分布随机数
rand_nums = np.random.rand(2,3)
# 生成标准正态分布随机数
normal_nums = np.random.randn(5)

3.2 正态分布深度解析

参数控制技巧

# 生成μ=5, σ=2的正态分布样本
mu, sigma = 5, 2
samples = np.random.normal(mu, sigma, 1000)
# 验证均值和标准差
print(f"实际均值: {np.mean(samples):.2f}")
print(f"实际标准差: {np.std(samples):.2f}")

可视化验证

import matplotlib.pyplot as plt
plt.hist(samples, bins=30, density=True)
plt.show()  # 应呈现钟形曲线

3.3 高级随机分布

均匀分布

# 生成[a,b)区间均匀分布
uniform_nums = np.random.uniform(low=-1, high=1, size=1000)

整数随机数

# 生成1-6的随机整数（模拟骰子）
dice_rolls = np.random.randint(1, 7, size=100)

四、数组运算进阶技巧

4.1 广播机制应用

# 矩阵与向量相加（自动广播）
matrix = np.array([[1,2,3],[4,5,6]])
vector = np.array([10,20,30])
result = matrix + vector
# 输出：
# array([[11, 22, 33],
#        [14, 25, 36]])

4.2 条件索引操作

# 生成随机数组并筛选大于0.5的元素
arr = np.random.rand(5,5)
mask = arr > 0.5
filtered = arr[mask]  # 返回一维数组

4.3 数学函数应用

# 逐元素数学运算
x = np.array([1,4,9])
sqrt_x = np.sqrt(x)  # 平方根
exp_x = np.exp(x)    # 指数运算
log_x = np.log(x)    # 自然对数

五、性能优化实践

5.1 内存预分配

对于大规模数组运算，预分配内存可显著提升性能：

# 不推荐：动态扩展数组
result = []
for i in range(10000):
    result.append(i*i)
result = np.array(result)
# 推荐：预分配内存
result = np.empty(10000)
for i in range(10000):
    result[i] = i*i

5.2 向量化运算

# 非向量化实现（慢）
def slow_sum(arr):
    total = 0
    for x in arr:
        total += x
    return total
# 向量化实现（快）
def fast_sum(arr):
    return np.sum(arr)

六、实际应用案例

6.1 图像处理

# 生成随机灰度图像（100x100）
image = np.random.randint(0, 256, (100,100), dtype=np.uint8)
# 添加高斯噪声
noise = np.random.normal(0, 25, image.shape)
noisy_image = np.clip(image + noise, 0, 255).astype(np.uint8)

6.2 金融建模

# 模拟股票价格路径（几何布朗运动）
def simulate_stock_price(S0, mu, sigma, T, steps):
    dt = T/steps
    rand_nums = np.random.normal(0, 1, steps)
    t = np.linspace(0, T, steps)
    W = np.cumsum(rand_nums)*np.sqrt(dt)  # 维纳过程
    S = S0 * np.exp((mu-0.5*sigma**2)*t + sigma*W)
    return S
prices = simulate_stock_price(100, 0.05, 0.2, 1, 252)

通过系统掌握这些数组操作技术，开发者能够高效处理从简单数据整理到复杂科学计算的各类任务。建议结合具体应用场景，通过实践加深对广播机制、内存管理等高级特性的理解。

NumPy数组生成与运算全解析：从基础到进阶