深度学习入门实战：从基础概念到手写数字识别

深度学习入门实战：从基础概念到手写数字识别 | 极客日志

import tensorflow as tf
from tensorflow.keras import datasets, layers, models
import matplotlib.pyplot as plt

# 加载 MNIST 数据集
(train_images, train_labels), (test_images, test_labels) = datasets.mnist.load_data()

print(f"训练集形状：{train_images.shape}")
print(f"测试集形状：{test_images.shape}")

plt.figure(figsize=(10, 4))
for i in range(10):
    plt.subplot(2, 5, i+1)
    plt.imshow(train_images[i], cmap='gray')
    plt.title(f"Label: {train_labels[i]}")
    plt.axis('off')
plt.show()

# 归一化
train_images = train_images.astype('float32') / 255.0
test_images = test_images.astype('float32') / 255.0

# 重塑形状 (Batch_Size, Height, Width, Channels)
train_images = train_images.reshape(-1, 28, 28, 1)
test_images = test_images.reshape(-1, 28, 28, 1)

model = models.Sequential([
    layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1)),
    layers.MaxPooling2D((2, 2)),
    layers.Conv2D(64, (3, 3), activation='relu'),
    layers.MaxPooling2D((2, 2)),
    layers.Flatten(),
    layers.Dense(128, activation='relu'),
    layers.Dropout(0.5),  # 防止过拟合
    layers.Dense(10, activation='softmax')
])

model.summary()

model.compile(optimizer='adam',
              loss='sparse_categorical_crossentropy',
              metrics=['accuracy'])

# 训练模型
history = model.fit(train_images, train_labels,
                    epochs=5,
                    batch_size=64,
                    validation_split=0.1,
                    verbose=1)

acc = history.history['accuracy']
val_acc = history.history['val_accuracy']
loss = history.history['loss']
val_loss = history.history['val_loss']

epochs_range = range(len(acc))

plt.figure(figsize=(12, 4))
plt.subplot(1, 2, 1)
plt.plot(epochs_range, acc, label='Training Accuracy')
plt.plot(epochs_range, val_acc, label='Validation Accuracy')
plt.legend(loc='lower right')
plt.title('Accuracy')

plt.subplot(1, 2, 2)
plt.plot(epochs_range, loss, label='Training Loss')
plt.plot(epochs_range, val_loss, label='Validation Loss')
plt.legend(loc='upper right')
plt.title('Loss')
plt.show()

# 保存模型
model.save('mnist_cnn_model.h5')

# 加载模型
loaded_model = models.load_model('mnist_cnn_model.h5')

# 预测示例
predictions = loaded_model.predict(test_images)
predicted_label = predictions[0].argmax()
print(f"预测结果：{predicted_label}, 真实标签：{test_labels[0]}")

深度学习入门实战：从基础概念到手写数字识别

深度学习入门实战：从基础概念到手写数字识别

一、深度学习的基本概念

1.1 核心术语解析

1.2 主流深度学习框架对比

1.3 经典模型架构

二、经典入门 Demo 实战：手写数字识别

2.1 任务背景

2.2 环境准备

2.3 数据预处理

1. 加载数据

2. 数据可视化

3. 归一化与重塑

2.4 构建神经网络模型

2.5 编译与训练

2.6 模型评估与可视化

2.7 模型保存与预测

三、常见问题与优化策略

3.1 过拟合（Overfitting）

3.2 欠拟合（Underfitting）

四、总结

更多推荐文章

相关免费在线工具

深度学习入门实战：从基础概念到手写数字识别

深度学习入门实战：从基础概念到手写数字识别

一、深度学习的基本概念

1.1 核心术语解析

1.2 主流深度学习框架对比

1.3 经典模型架构

二、经典入门 Demo 实战：手写数字识别

2.1 任务背景

2.2 环境准备

2.3 数据预处理

1. 加载数据

2. 数据可视化

3. 归一化与重塑

2.4 构建神经网络模型

2.5 编译与训练

2.6 模型评估与可视化

2.7 模型保存与预测

三、常见问题与优化策略

3.1 过拟合（Overfitting）

3.2 欠拟合（Underfitting）

四、总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具