ICLR 2024 杰出论文公布：大模型成最大赢家

昨天，国际表征学习大会（International Conference on Learning Representations，ICLR）公布了 ICLR 2024 杰出论文。

ICLR 2024 杰出论文相关图片

其中，在 5 篇杰出论文中，有 4 篇论文涉及大模型。另外，也有 11 篇论文获得荣誉提名。祝贺各位获奖者！

ICLR 是机器学习领域的一个学术会议，每年一次，通常在每年四月底或五月初举行。会议包括特邀演讲以及经评审论文的口头和海报展示。

ICLR 由 Yann LeCun（杨立昆）和 Yoshua Bengio 两位图灵奖得主创立，被学术研究者们广泛认可，被认为是'深度学习的顶级会议'。自 2013 年举办首届起，该会议一直采用开放式同行评审。

目前，ICLR 2024 正在奥地利维也纳举行（5 月 7 日 - 11 日）。

杰出论文奖

论文 1：Generalization in diffusion models arises from geometry-adaptive harmonic representations

作者：Zahra Kadkhodaie, Florentin Guth, Eero P Simoncelli, Stéphane Mallat
所属机构：纽约大学、Simons Foundation
论文链接：https://openreview.net/forum?id=ANvmVS2Yr0
获奖理由：这篇论文对图像扩散模型的泛化和记忆方面进行了重要的深入分析。作者根据经验研究了图像生成模型何时从记忆输入转换到泛化机制，并通过几何自适应谐波表征与谐波分析的思想建立联系，进一步从建筑归纳偏差的角度解释了这一现象。论文涵盖了我们对视觉生成模型理解中的一个关键缺失部分，很可能会对该领域未来的重要理论研究有所启发。

ICLR 2024 杰出论文相关图片

论文 2：Learning Interactive Real-World Simulators

作者：Sherry Yang, Yilun Du, Seyed Kamyar Seyed Ghasemipour, Jonathan Tompson, Leslie Pack Kaelbling, Dale Schuurmans, Pieter Abbeel
所属机构：UC 伯克利、Google DeepMind、MIT
论文链接：https://openreview.net/forum?id=sFyTZEqmUY
获奖理由：汇集多个来源的数据来训练机器人基础模型是一个长期的宏伟目标。由于不同的机器人具有不同的感知 - 运动界面，这阻碍了大规模数据集的训练，因此带来了巨大的挑战。这项名为'UniSim'的工作是朝着这个方向迈出的重要一步，也是一项工程壮举，它使用基于视觉感知和控制文字描述的统一界面来聚合数据，并利用视觉和语言领域的最新发展，从数据中训练机器人模拟器。

ICLR 2024 杰出论文相关图片

ICLR 2024 杰出论文公布：大模型成最大赢家

杰出论文奖

论文 1：Generalization in diffusion models arises from geometry-adaptive harmonic representations

论文 2：Learning Interactive Real-World Simulators

更多推荐文章

相关免费在线工具

论文 3：Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors

论文 4：Protein Discovery with Discrete Walk-Jump Sampling

论文 5：Vision Transformers Need Registers

杰出论文奖荣誉提名

论文 1：Amortizing intractable inference in large language models

论文 2：Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

论文 3：Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness

论文 4：Flow Matching on General Geometries

论文 5：Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

论文 6：Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction

论文 7：Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

论文 8：Proving Test Set Contamination in Black-Box Language Models

论文 9：Robust agents learn causal world models

论文 10：The mechanistic basis of data dependence and abrupt learning in an in-context classification task

论文 11：Towards a statistical theory of data selection under weak supervision

更多推荐文章

相关免费在线工具

ICLR 2024 杰出论文公布：大模型成最大赢家

杰出论文奖

论文 1：Generalization in diffusion models arises from geometry-adaptive harmonic representations

论文 2：Learning Interactive Real-World Simulators

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

论文 3：Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors

论文 4：Protein Discovery with Discrete Walk-Jump Sampling

论文 5：Vision Transformers Need Registers

杰出论文奖荣誉提名

论文 1：Amortizing intractable inference in large language models

论文 2：Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

论文 3：Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness

论文 4：Flow Matching on General Geometries

论文 5：Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

论文 6：Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction

论文 7：Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

论文 8：Proving Test Set Contamination in Black-Box Language Models

论文 9：Robust agents learn causal world models

论文 10：The mechanistic basis of data dependence and abrupt learning in an in-context classification task

论文 11：Towards a statistical theory of data selection under weak supervision

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具