#long-context

총 1건 · 1/1 페이지

전체 24시간 7일 30일

전체 🇰🇷 한국어 본문

전체 High(60+) ⭐ Must-read(75+)

최신순 점수순

Sebastian Raschka · 2026-05-16 번역

LLM 아키텍처의 최근 발전: KV 공유, mHC, 그리고 압축된 어텐션

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs

#long-context #open-weight-models #llm-architectures #attention-mechanisms #kv-sharing #compressed-attention