#task-specific-evals
총 1건 · 1/1 페이지
-
작동하고 작동하지 않는 작업별 LLM 평가
Task-Specific LLM Evals that Do & Don't Work
Evals for classification, summarization, translation, copyright regurgitation, and toxicity.