-
LLM 아키텍처의 최근 발전: KV 공유, mHC, 그리고 압축된 어텐션
Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention
From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs
-
NVIDIA Nemotron 3 Nano Omni 소개: 문서, 오디오, 비디오 에이전트를 위한 장문맥 멀티모달 지능
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents
-
긴 맥락 질의응답 시스템 평가
Evaluating Long-Context Question & Answer Systems
Evaluation metrics, how to build eval datasets, eval methodology, and a review of several benchmarks.