#attention-mechanisms

총 2건 · 1/1 페이지

전체 24시간 7일 30일

Sebastian Raschka · 2026-05-16 제목번역

LLM 아키텍처의 최근 발전: KV 공유, mHC, 그리고 압축된 어텐션

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs

#long-context #open-weight-models #llm-architectures #attention-mechanisms #kv-sharing #compressed-attention
Sebastian Raschka · 2026-03-22 제목번역

현대 LLM의 어텐션 변형 시각 가이드

A Visual Guide to Attention Variants in Modern LLMs

From MHA and GQA to MLA, sparse attention, and hybrid architectures

#large-language-models #machine-learning #transformer-architecture #sparse-attention #attention-mechanisms #group-query-attention