归档 - 善良的xwysyy

速查 4 笔记 2

2026

33 篇文章

Reconciling Contradictory Views on the Effectiveness of SFT in LLMs: An Interaction Perspective

#SFT #LLM #Explainability

强化学习算法梳理：从 PPO 到 GRPO 及之后

2024

1 篇文章

2023

3 篇文章

© 2026 xwysyy. All Rights Reserved.

Powered by Astro & Firefly