AI News · #long-context

GeekNews · 10일 전 제목번역

Laguna S 2.1 공개

Laguna S 2.1 공개 | GeekNews

<ul> <li>Poolside가 장기 작업과 추론 능력을 강화한 Laguna S 2.1을 공개함. 총 118B MoE 중 토큰당 8B 매개변수를 활성화하며, thinking·no-thinking 모드 모두 최대 1M 토큰 컨텍스트를 지원함</li> <li>학습…

#language-model #llm #long-context #moe #poolside

GeekNews · 2026-06-30 제목번역

Memora: 장기 작업을 위한 확장형 메모리 시스템

Memora: 장기 작업을 위한 확장형 메모리 시스템 | GeekNews

<h3>요약 개요</h3> <ul> <li> 목적 <ul> <li>AI 에이전트가 대화·문서에서 필요한 정보를 자동으로 추출하고 장기적으로 저장·검색할 수 있도록 지원하는 메모리 프레임워크</li> </ul> </li> <li> 핵심 설계</p…

#ai-agents #long-context #framework #memory-systems #information-extraction #scalable

GeekNews · 2026-06-24 제목번역

Unlimited OCR — Baidu의 원샷 장문 파싱 모델

Unlimited OCR — Baidu의 원샷 장문 파싱 모델 | GeekNews

<ul> <li>DeepSeek OCR를 기반으로 디코더의 모든 어텐션을 교체해, 수십 페이지 문서를 한 번의 순전파(forward pass) 로 전사하는 E2E OCR 모델</li> <li>핵심은 참조 슬라이딩 윈도우 어텐션(R-SWA)</str…

#deepseek #document-processing #long-context #attention-mechanisms #optical-character-recognition

GeekNews · 2026-06-09 제목번역

Claude Fable 5/Mythos 5 공개, Anthropic의 5세대 프런티어 모델

Claude Fable 5/Mythos 5 공개, Anthropic의 5세대 프런티어 모델 | GeekNews

<ul> <li>Anthropic이 며칠 단위의 장기, 비동기 작업을 위한 5세대 모델을 출시함. Fable 5는 Mythos급 모델을 일반 사용자용으로 안전하게 만든 버전이고, Mythos 5는 같은 모델에서 일부 안전장치를 푼 버전임</li> <li>Mythos급은 Opus급보다 위에 있는 새 모델 티어. 첫 모델인 …

#anthropic #long-context #claude-fable-5 #claude-mythos-5 #frontier-model #async-tasks

Sebastian Raschka · 2026-05-16 제목번역

LLM 아키텍처의 최근 발전: KV 공유, mHC, 그리고 압축된 어텐션

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs

#long-context #open-weight-models #llm-architectures #attention-mechanisms #kv-sharing #compressed-attention