-
LLM 아키텍처를 이해하기 위한 내 워크플로우
My Workflow for Understanding LLM Architectures
A learning-oriented workflow for understanding new open-weight model releases
-
DeepSeek V3에서 V3.2로: 아키텍처, 희소 주의, 강화학습 업데이트
From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates
Understanding How DeepSeek's Flagship Open-Weight Models Evolved
-
주요 LLM 아키텍처 비교
The Big LLM Architecture Comparison
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design