ICLR 2024 杰出论文：大模型相关工作占了大头

昨天，国际表征学习大会（International Conference on Learning Representations，ICLR）公布了 ICLR 2024 杰出论文。

ICLR 2024 杰出论文相关图片

这次一共评出 5 篇杰出论文，其中 4 篇都和大模型有关；另外还有 11 篇论文拿到荣誉提名。结果并不意外，但这个比例还是挺显眼的：ICLR 这几年已经很难绕开大模型了。

ICLR 是机器学习领域的重要会议，每年举办一次，通常在四月底或五月初举行。会议内容包括特邀演讲，以及经评审论文的口头和海报展示。

ICLR 由 Yann LeCun（杨立昆）和 Yoshua Bengio 两位图灵奖得主创立，在深度学习圈子里一直有很高的存在感。自 2013 年首届会议起，它就采用开放式同行评审。当前的 ICLR 2024 正在奥地利维也纳举行，时间是 5 月 7 日到 11 日。

杰出论文奖

论文 1：Generalization in diffusion models arises from geometry-adaptive harmonic representations

作者：Zahra Kadkhodaie, Florentin Guth, Eero P Simoncelli, Stéphane Mallat
所属机构：纽约大学、Simons Foundation
论文链接：https://openreview.net/forum?id=ANvmVS2Yr0
获奖理由：这篇论文盯住的是扩散模型里一个老问题：什么时候模型在'记住'输入，什么时候真的开始泛化。作者从几何自适应谐波表征和谐波分析的角度，把这个现象讲清楚了一些，也补上了我们理解视觉生成模型时缺的一块。它不是那种特别炫的工作，但理论上的洞察够扎实。

ICLR 2024 杰出论文相关图片

论文 2：Learning Interactive Real-World Simulators

作者：Sherry Yang, Yilun Du, Seyed Kamyar Seyed Ghasemipour, Jonathan Tompson, Leslie Pack Kaelbling, Dale Schuurmans, Pieter Abbeel
所属机构：UC 伯克利、Google DeepMind、MIT
论文链接：https://openreview.net/forum?id=sFyTZEqmUY
获奖理由：这篇工作讨论的是怎么把来自不同机器人的数据汇到一起，训练机器人基础模型。难点不在'数据很多'这件事本身，而在不同机器人有不同的感知和控制接口，数据天然碎片化。UniSim 的做法是用一个统一界面把这些数据接起来，再借助视觉和语言领域的新进展去训练模拟器。更像一项工程能力很强的推进，而不是单点技巧。

ICLR 2024 杰出论文相关图片

ICLR 2024 杰出论文：大模型相关工作占了大头

杰出论文奖

论文 1：Generalization in diffusion models arises from geometry-adaptive harmonic representations

论文 2：Learning Interactive Real-World Simulators

更多推荐文章

相关免费在线工具

论文 3：Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors

论文 4：Protein Discovery with Discrete Walk-Jump Sampling

论文 5：Vision Transformers Need Registers

杰出论文奖荣誉提名

论文 1：Amortizing intractable inference in large language models

论文 2：Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

论文 3：Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness

论文 4：Flow Matching on General Geometries

论文 5：Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

论文 6：Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction

论文 7：Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

论文 8：Proving Test Set Contamination in Black-Box Language Models

论文 9：Robust agents learn causal world models

论文 10：The mechanistic basis of data dependence and abrupt learning in an in-context classification task

论文 11：Towards a statistical theory of data selection under weak supervision

更多推荐文章

相关免费在线工具

ICLR 2024 杰出论文：大模型相关工作占了大头

杰出论文奖

论文 1：Generalization in diffusion models arises from geometry-adaptive harmonic representations

论文 2：Learning Interactive Real-World Simulators

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

论文 3：Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors

论文 4：Protein Discovery with Discrete Walk-Jump Sampling

论文 5：Vision Transformers Need Registers

杰出论文奖荣誉提名

论文 1：Amortizing intractable inference in large language models

论文 2：Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

论文 3：Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness

论文 4：Flow Matching on General Geometries

论文 5：Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

论文 6：Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction

论文 7：Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

论文 8：Proving Test Set Contamination in Black-Box Language Models

论文 9：Robust agents learn causal world models

论文 10：The mechanistic basis of data dependence and abrupt learning in an in-context classification task

论文 11：Towards a statistical theory of data selection under weak supervision

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具