반응형
llm 사후 학습
-
대규모 언어 모델(LLM) 사후학습(Post-Training) 전략 개요AI와 함께 2025. 3. 27. 11:11
AI 논문 리뷰https://arxiv.org/html/2502.21321 LLM Post-Training: A Deep Dive into Reasoning Large Language ModelsLLM Post-Training: A Deep Dive into Reasoning Large Language Models Komal Kumar∗, Tajamul Ashraf∗, Omkar Thawakar, Rao Muhammad Anwer, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Phillip H.S. Torr, Fahad Shahbaz Khan, Salman Khan ∗Equal contribuarxiv.org 총 5가지로 파트로 나눠서 AI로 정리한 글임 1. ..