-
LLM 아키텍처의 최근 발전: KV 공유, mHC, 그리고 압축된 어텐션
Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention
From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs
-
현대 LLM의 어텐션 변형 시각 가이드
A Visual Guide to Attention Variants in Modern LLMs
From MHA and GQA to MLA, sparse attention, and hybrid architectures