IO 流为什么只能读取一次？从底层原理到 Web 实战 | 极客日志

Javajava

IO 流为什么只能读取一次？从底层原理到 Web 实战

IO 流通常只能读取一次，因为底层维护位置指针，读取后指针移动或数据被消耗。网络流和文件流尤其如此。解决方案包括使用支持 mark/reset 的内存流（如 ByteArrayInputStream），或在 Web 开发中通过包装请求流（如 CachedBodyHttpServletRequest）将数据缓存到内存中实现多次读取。需注意内存占用与 IO 性能的权衡，小请求体适合缓存，大请求体建议流式处理。

随缘发布于 2026/4/5更新于 2026/7/2046 浏览

IO 流为什么只能读取一次？从底层原理到 Web 实战

引言：一个让无数开发者困惑的问题

在 Web 开发中，你是否遇到过这样的场景：

@RestController
public class UserController {
    @PostMapping("/user")
    public String createUser(@RequestBody User user) {
        // 这里收到的 user 为 null 或数据不完整！
        return "success";
    }
}

明明在过滤器中已经读取过请求体了：

@WebFilter("/*")
public class LogFilter implements Filter {
    public void doFilter(ServletRequest request, ServletResponse response, FilterChain chain) throws IOException, ServletException {
        InputStream is = request.getInputStream();
        String body = IOUtils.toString(is);
        // 读取了请求体
        // ...
        chain.doFilter(request, response);
    }
}

问题：为什么过滤器读取后，Controller 就收不到数据了？

答案是：IO 流通常只能被读取一次。本文将深入剖析这一现象背后的原理，并提供解决方案。

1. IO 流的本质：顺序读取的'磁带'

1.1 位置指针（Position Pointer）

所有基于流的读取操作都维护着一个位置指针：

public abstract class  {
    
    
         IOException;
}

相关免费在线工具

Keycode 信息
查找任何按下的键的javascript键代码、代码、位置和修饰符。在线工具，Keycode 信息在线工具，online
Escape 与 Native 编解码
JavaScript 字符串转义/反转义；Java 风格 \uXXXX（Native2Ascii）编码与解码。在线工具，Escape 与 Native 编解码在线工具，online
JavaScript / HTML 格式化
使用 Prettier 在浏览器内格式化 JavaScript 或 HTML 片段。在线工具，JavaScript / HTML 格式化在线工具，online
JavaScript 压缩与混淆
Terser 压缩、变量名混淆，或 javascript-obfuscator 高强度混淆（体积会增大）。在线工具，JavaScript 压缩与混淆在线工具，online
Base64 字符串编码/解码
将字符串编码和解码为其 Base64 格式表示形式即可。在线工具，Base64 字符串编码/解码在线工具，online
Base64 文件转换器
将字符串、文件或图像转换为其 Base64 表示形式。在线工具，Base64 文件转换器在线工具，online

public class StreamReadSimulation {
    public static void main(String[] args) throws IOException {
        byte[] data = {65, 66, 67, 68}; // ABCD
        // 模拟 InputStream
        ByteArrayInputStream stream = new ByteArrayInputStream(data);
        
        System.out.println("第 1 次读取：" + stream.read()); // 65 (A)
        System.out.println("第 2 次读取：" + stream.read()); // 66 (B)
        System.out.println("第 3 次读取：" + stream.read()); // 67 (C)
        System.out.println("第 4 次读取：" + stream.read()); // 68 (D)
        System.out.println("第 5 次读取：" + stream.read()); // -1 (EOF)
        // 指针已到末尾，无法再读取
        System.out.println("第 6 次读取：" + stream.read()); // -1
    }
}

第 1 次读取：65
第 2 次读取：66
第 3 次读取：67
第 4 次读取：68
第 5 次读取：-1
第 6 次读取：-1

数据源类型	为什么只能读一次	类比
网络流	数据是实时传输的，TCP 缓冲区数据读取后即丢弃	直播流，无法回放
文件流	底层是操作系统文件句柄，顺序读取效率最高	磁带播放器
控制台流	用户输入是一次性的	一次性对话

// InputStream.java (JDK 源码片段)
public abstract class InputStream implements Closeable {
    // 抽象方法，由子类实现真正的读取
    public abstract int read() throws IOException;

    // 批量读取，本质是循环调用 read()
    public int read(byte b[], int off, int len) throws IOException {
        if (b == null) {
            throw new NullPointerException();
        } else if (off < 0 || len < 0 || len > b.length - off) {
            throw new IndexOutOfBoundsException();
        } else if (len == 0) {
            return 0;
        }
        int c = read(); // 调用 read() 读取第一个字节
        if (c == -1) {
            return -1;
        }
        b[off] = (byte) c;
        int i = 1;
        try {
            for (; i < len; i++) {
                c = read();
                if (c == -1) {
                    break;
                }
                b[off + i] = (byte) c;
            }
        } catch (IOException ee) {}
        return i;
    }

    // 跳过 n 个字节，指针移动但不读取
    public long skip(long n) throws IOException {
        long remaining = n;
        while (remaining > 0) {
            if (read() == -1) {
                break;
            }
            remaining--;
        }
        return n - remaining;
    }
}

// FileInputStream.java (简化版)
public class FileInputStream extends InputStream {
    // 文件描述符
    private final FileDescriptor fd;
    // 本地方法，真正读取一个字节
    private native int read0() throws IOException;

    @Override
    public int read() throws IOException {
        // 调用本地方法，操作系统维护文件指针
        return read0();
    }
}

// SocketInputStream.java (简化版)
class SocketInputStream extends FileInputStream {
    @Override
    public int read() throws IOException {
        // 网络数据从 TCP 缓冲区读取
        // 读取后数据从缓冲区移除
        return super.read();
    }
}

public class MarkResetExample {
    public static void main(String[] args) throws IOException {
        byte[] data = "Hello World".getBytes();
        ByteArrayInputStream bais = new ByteArrayInputStream(data);
        System.out.println("是否支持 mark/reset: " + bais.markSupported()); // true
        
        // 标记当前位置
        bais.mark(0);
        // 第一次读取
        byte[] first = new byte[5];
        bais.read(first);
        System.out.println("第一次读取：" + new String(first)); // Hello
        
        // 重置到标记位置
        bais.reset();
        // 第二次读取（相同内容）
        byte[] second = new byte[5];
        bais.read(second);
        System.out.println("第二次读取：" + new String(second)); // Hello
    }
}

是否支持 mark/reset: true
第一次读取：Hello
第二次读取：Hello

// ByteArrayInputStream.java (简化版)
public class ByteArrayInputStream extends InputStream {
    protected byte buf[];      // 数据缓冲区
    protected int pos;         // 当前读取位置
    protected int mark;        // 标记位置

    @Override
    public void mark(int readAheadLimit) {
        mark = pos; // 保存当前指针位置
    }

    @Override
    public void reset() {
        pos = mark; // 恢复指针到标记位置
    }

    @Override
    public boolean markSupported() {
        return true;
    }
}

流类型	是否支持 mark	原因
ByteArrayInputStream	✅ 支持	数据在内存中，可重复读取
BufferedInputStream	✅ 支持	内部有缓冲区
FileInputStream	❌ 不支持	依赖操作系统文件指针
SocketInputStream	❌ 不支持	网络数据实时传输
System.in	❌ 不支持	控制台输入一次性的

@WebFilter("/*")
public class LoggingFilter implements Filter {
    @Override
    public void doFilter(ServletRequest request, ServletResponse response, FilterChain chain) throws IOException, ServletException {
        HttpServletRequest req = (HttpServletRequest) request;
        // 读取请求体用于日志
        String body = readBody(req.getInputStream());
        System.out.println("请求体：" + body);
        // 传递给 Controller
        chain.doFilter(request, response);
        // ❌ Controller 会收不到数据
    }

    private String readBody(InputStream is) throws IOException {
        ByteArrayOutputStream result = new ByteArrayOutputStream();
        byte[] buffer = new byte[1024];
        int length;
        while ((length = is.read(buffer)) != -1) {
            result.write(buffer, 0, length);
        }
        return result.toString();
    }
}

public class CachedBodyHttpServletRequest extends HttpServletRequestWrapper {
    private final byte[] cachedBody; // 缓存请求体

    public CachedBodyHttpServletRequest(HttpServletRequest request) throws IOException {
        super(request);
        // 读取并缓存请求体
        this.cachedBody = readBody(request.getInputStream());
    }

    private byte[] readBody(InputStream is) throws IOException {
        ByteArrayOutputStream baos = new ByteArrayOutputStream();
        byte[] buffer = new byte[1024];
        int read;
        while ((read = is.read(buffer)) != -1) {
            baos.write(buffer, 0, read);
        }
        return baos.toByteArray();
    }

    @Override
    public ServletInputStream getInputStream() throws IOException {
        // 每次调用都返回新的流，基于缓存的数据
        return new CachedBodyServletInputStream(this.cachedBody);
    }

    @Override
    public BufferedReader getReader() throws IOException {
        return new BufferedReader(new InputStreamReader(getInputStream()));
    }
}

class CachedBodyServletInputStream extends ServletInputStream {
    private final ByteArrayInputStream inputStream;

    public CachedBodyServletInputStream(byte[] cachedBody) {
        this.inputStream = new ByteArrayInputStream(cachedBody);
    }

    @Override
    public int read() throws IOException {
        return inputStream.read();
    }

    @Override
    public boolean isFinished() {
        return inputStream.available() == 0;
    }

    @Override
    public boolean isReady() {
        return true;
    }

    @Override
    public void setReadListener(ReadListener listener) {
        // 简化实现
    }
}

@WebFilter("/*")
public class CachingFilter implements Filter {
    @Override
    public void doFilter(ServletRequest request, ServletResponse response, FilterChain chain) throws IOException, ServletException {
        HttpServletRequest req = (HttpServletRequest) request;
        // 包装请求
        CachedBodyHttpServletRequest cachedRequest = new CachedBodyHttpServletRequest(req);
        // 可以多次读取请求体
        System.out.println("过滤器第 1 次读取：" + IOUtils.toString(cachedRequest.getInputStream()));
        // 再次读取（有效！）
        System.out.println("过滤器第 2 次读取：" + IOUtils.toString(cachedRequest.getInputStream()));
        // 传递给 Controller
        chain.doFilter(cachedRequest, response);
        // ✅ Controller 能正常接收数据
    }
}

@WebFilter
public class SpringCachingFilter extends OncePerRequestFilter {
    @Override
    protected void doFilterInternal(HttpServletRequest request, HttpServletResponse response, FilterChain chain) throws IOException, ServletException {
        // Spring 内置的包装类
        ContentCachingRequestWrapper wrapper = new ContentCachingRequestWrapper(request);
        chain.doFilter(wrapper, response);
        // 请求处理后读取缓存的内容（此时才能读到）
        byte[] body = wrapper.getContentAsByteArray();
        // 记录日志等
    }
}

public class RepeatableInputStream extends InputStream {
    private final byte[] data;
    private int position;
    private int markPosition;

    public RepeatableInputStream(byte[] data) {
        this.data = data;
        this.position = 0;
    }

    public RepeatableInputStream(InputStream is) throws IOException {
        ByteArrayOutputStream baos = new ByteArrayOutputStream();
        byte[] buffer = new byte[8192];
        int len;
        while ((len = is.read(buffer)) != -1) {
            baos.write(buffer, 0, len);
        }
        this.data = baos.toByteArray();
        this.position = 0;
    }

    @Override
    public int read() throws IOException {
        if (position >= data.length) {
            return -1;
        }
        return data[position++] & 0xFF;
    }

    @Override
    public int read(byte[] b, int off, int len) throws IOException {
        if (b == null) {
            throw new NullPointerException();
        }
        if (off < 0 || len < 0 || len > b.length - off) {
            throw new IndexOutOfBoundsException();
        }
        if (position >= data.length) {
            return -1;
        }
        int available = data.length - position;
        int toRead = Math.min(len, available);
        System.arraycopy(data, position, b, off, toRead);
        position += toRead;
        return toRead;
    }

    @Override
    public long skip(long n) throws IOException {
        int available = data.length - position;
        int toSkip = (int) Math.min(n, available);
        position += toSkip;
        return toSkip;
    }

    @Override
    public int available() throws IOException {
        return data.length - position;
    }

    @Override
    public boolean markSupported() {
        return true;
    }

    @Override
    public void mark(int readlimit) {
        markPosition = position; // 标记当前位置
    }

    @Override
    public void reset() throws IOException {
        position = markPosition; // 重置到标记位置
    }
}

public class RepeatableStreamDemo {
    public static void main(String[] args) throws IOException {
        // 原始流（只能读一次）
        InputStream original = new FileInputStream("test.txt");
        // 包装成可重复读的流
        RepeatableInputStream repeatable = new RepeatableInputStream(original);
        // 可以多次读取
        System.out.println("第 1 次读取：" + IOUtils.toString(repeatable, "UTF-8"));
        repeatable.reset(); // 重置
        System.out.println("第 2 次读取：" + IOUtils.toString(repeatable, "UTF-8"));
    }
}

方案	优点	缺点	适用场景
直接读取	内存占用小	只能读一次	大型文件流式处理
缓存到内存	可多次读取	内存占用大	小请求体（<1MB）
缓存到磁盘	可多次读取	IO 开销大	超大文件需重复处理

@Component
public class RequestBodyCacheAdvice {
    // 配置：只缓存小请求体
    private static final int MAX_CACHE_SIZE = 1024 * 1024; // 1MB

    public HttpServletRequest wrapIfNeeded(HttpServletRequest request) {
        if (isSmallRequest(request)) {
            return new CachedBodyHttpServletRequest(request);
        }
        return request; // 大请求不缓存，避免内存溢出
    }

    private boolean isSmallRequest(HttpServletRequest request) {
        String contentLength = request.getHeader("Content-Length");
        if (contentLength != null) {
            try {
                return Integer.parseInt(contentLength) <= MAX_CACHE_SIZE;
            } catch (NumberFormatException e) {
                return false;
            }
        }
        return false; // 未知大小，不缓存
    }
}

public class PerformanceTest {
    public static void main(String[] args) throws IOException {
        byte[] data = new byte[1024 * 1024]; // 1MB 数据
        new Random().nextBytes(data);

        // 1. 直接读取
        ByteArrayInputStream bais = new ByteArrayInputStream(data);
        long start = System.nanoTime();
        readFully(bais);
        long directTime = System.nanoTime() - start;

        // 2. 缓存后读取
        ByteArrayInputStream bais2 = new ByteArrayInputStream(data);
        byte[] cached = readFully(bais2);
        start = System.nanoTime();
        for (int i = 0; i < 10; i++) {
            ByteArrayInputStream cachedStream = new ByteArrayInputStream(cached);
            readFully(cachedStream);
        }
        long cachedTime = System.nanoTime() - start;

        System.out.println("直接读取：" + directTime / 1_000_000 + "ms");
        System.out.println("缓存后读取 10 次：" + cachedTime / 1_000_000 + "ms");
    }

    private static byte[] readFully(InputStream is) throws IOException {
        ByteArrayOutputStream baos = new ByteArrayOutputStream();
        byte[] buffer = new byte[8192];
        int len;
        while ((len = is.read(buffer)) != -1) {
            baos.write(buffer, 0, len);
        }
        return baos.toByteArray();
    }
}

概念	类比	原因
位置指针	磁带机的磁头	操作系统和网络协议栈的设计
顺序读取	一次性吸管	数据源的实时性要求
mark/reset	书签	仅适用于内存数据源
包装缓存	录像回放	通过内存存储实现多次读取

IO 流为什么只能读取一次？从底层原理到 Web 实战

IO 流为什么只能读取一次？从底层原理到 Web 实战

引言：一个让无数开发者困惑的问题

1. IO 流的本质：顺序读取的'磁带'

1.1 位置指针（Position Pointer）

更多推荐文章

相关免费在线工具

1.2 读取过程模拟

1.3 为什么设计成只能读一次？

2. 深入源码：InputStream 的 read 机制

2.1 核心方法分析

2.2 FileInputStream 的实现

2.3 SocketInputStream 的实现

3. 例外情况：支持重置的流

3.1 ByteArrayInputStream 支持重置

3.2 mark/reset 原理

3.3 常见流的支持情况

4. 实战：Web 请求体的多次读取

4.1 问题重现

4.2 解决方案：包装请求

4.3 过滤器中使用包装类

4.4 Spring 框架的解决方案

5. 高级技巧：包装流的多种实现

5.1 实现可重复读的 InputStream

5.2 使用示例

6. 性能考虑与最佳实践

6.1 内存 vs IO 的权衡

6.2 Web 应用中的最佳实践

6.3 性能对比

总结：IO 流读取的本质

更多推荐文章

相关免费在线工具

IO 流为什么只能读取一次？从底层原理到 Web 实战

IO 流为什么只能读取一次？从底层原理到 Web 实战

引言：一个让无数开发者困惑的问题

1. IO 流的本质：顺序读取的'磁带'

1.1 位置指针（Position Pointer）

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

1.2 读取过程模拟

1.3 为什么设计成只能读一次？

2. 深入源码：InputStream 的 read 机制

2.1 核心方法分析

2.2 FileInputStream 的实现

2.3 SocketInputStream 的实现

3. 例外情况：支持重置的流

3.1 ByteArrayInputStream 支持重置

3.2 mark/reset 原理

3.3 常见流的支持情况

4. 实战：Web 请求体的多次读取

4.1 问题重现

4.2 解决方案：包装请求

4.3 过滤器中使用包装类

4.4 Spring 框架的解决方案

5. 高级技巧：包装流的多种实现

5.1 实现可重复读的 InputStream

5.2 使用示例

6. 性能考虑与最佳实践

6.1 内存 vs IO 的权衡

6.2 Web 应用中的最佳实践

6.3 性能对比

总结：IO 流读取的本质

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具