#mixture-of-experts
총 2건 · 1/1 페이지
-
Mellum2 소개: JetBrains의 12B Mixture-of-Experts 모델
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
-
딥씨크 V4 - 거의 최고 수준의 성능, 저렴한 가격
DeepSeek V4 - almost on the frontier, a fraction of the price
<p>Chinese AI lab DeepSeek's last model release was V3.2 (and V3.2 Speciale) <a href="https://simonwillison.net/2025/Dec/1/deepseek-v32/">last December</a>. They just dropped the f…