#benchmark-performance

총 2건 · 1/1 페이지

전체 24시간 7일 30일

GeekNews · 2026-06-02 제목번역

MiniMax-M3 데뷔, 주요 벤치마크 성능에서 GPT-5.5와 Gemini 3.1 Pro를 능가하며 비용은 단 5-10% 수준

<blockquote> <p>중국 AI 스타트업 미니맥스(MiniMax)가 기존 미국의 상용 모델 대비 5~10% 수준의 파격적인 비용으로 GPT-5.5와 제미나이 3.1 프로를 능가하는 오픈 가중치 기반 멀티모달 대형언어모델 'M3'를 전격 출시했습니다.</p> </blockquote> <hr /> <h4>전문 번역</…

#minimax-m3 #benchmark-performance #cost-effective #multimodal #open-weight #ai-startup
Anthropic Engineering · 2026-03-06 제목번역

Claude Opus 4.6의 BrowseComp 성능에서의 평가 인식 (2026년 3월 6일)

Eval awareness in Claude Opus 4.6’s BrowseComp performance

#language-model #claude-opus #benchmark-performance #eval-awareness #browse-comp #web-browsing