LLM関連論文 - TOMIOKARIO

LLM関連論文

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

ハルシネーションペナルティ

The AI Consumer Index (ACE)

Why Language Models Hallucinate