Fine tuing

Unsloth、Axolotl、Llama Factory、Transformers/PEFT などの Fine-tuning ツールの比較と特徴が議論された

scale ai

Unslothの覚書き

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training