2024/12/16~2024/12/20 面白そうな論文リスト
Byte Latent Transformer: Patches Scale Better Than Tokens
いい感じのバイトにエンコード・デコードすることで言語モデルの負担を減らして性能を上げる感じ?
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization
最大情報量に基づく探索の強化学習
AlphaZero Neural Scaling and Zipf's Law: a Tale of Board Games and Power Laws
AlphaZeroにおけるZipf's Law
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Offline模倣学習からOnlineRLへ
Autoregressive Video Generation without Vector Quantization
離散化しないで自己回帰的に生成(VAEは使っている)
マルチトークンで一個分みたいな?
高速らしいけどなんで?
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
LLMでRLを強化する系
Scaling 4D Representations
自動運転系
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
3DGSベースの世界モデル
GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
3DGSベースの表現
WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model
VLMを用いたAD
Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model
拡散モデルを組み合わせてシミュレータ作り
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving
VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
DriveGPT: Scaling Autoregressive Behavior Models for Driving