-
AI와 함께 일하고 성과를 복합하는 방법
How to Work and Compound with AI
Context as infra, taste as config, verification for autonomy, scale via delegation, closing the loop.
-
2025 연간 회고
2025 Year in Review
An eventful year of progress in health and career, while making time for travel and reflection.
-
세 가지 간단한 단계로 제품 평가하기
Product Evals in Three Simple Steps
Label some data, align LLM-evaluators, and run the eval harness with each change.
-
새로운 Principal 기술 IC들을 위한 조언: 나에게 쓰는 노트
Advice for New Principal Tech ICs (i.e., Notes to Myself)
Based on what I've learned from role models and mentors in Amazon
-
의미 ID를 활용한 제어 가능 추천을 위한 LLM-RecSys 하이브리드 훈련
Training an LLM-RecSys Hybrid for Steerable Recs with Semantic IDs
An LLM that can converse in English & item IDs, and make recommendations w/o retrieval or tools.
-
긴 맥락 질의응답 시스템 평가
Evaluating Long-Context Question & Answer Systems
Evaluation metrics, how to build eval datasets, eval methodology, and a review of several benchmarks.
-
AI Engineer 2025 - LLM 기술로 추천 시스템과 검색 개선
AI Engineer 2025 - Improving RecSys & Search with LLM techniques
Recsys & search are converging with LLMs via semantic IDs, data augmentation, and unified foundation models.
-
뛰어난 리더십: 자질, 행동, 그리고 스타일
Exceptional Leadership: Some Qualities, Behaviors, and Styles
What makes a good leader? What do good leaders do? And commando, soldier, and police leadership.
-
MCP, Q, tmux를 이용한 일일 뉴스 요약 뉴스 에이전트 구축
Building News Agents for Daily News Recaps with MCP, Q, and tmux
Learning to automate simple agentic workflows with Amazon Q CLI, Anthropic MCP, and tmux.
-
LLM-as-Judge는 제품을 구하지 못합니다—프로세스 개선이 핵심입니다
An LLM-as-Judge Won't Save The Product—Fixing Your Process Will
Applying the scientific method, building via eval-driven development, and monitoring AI output.
-
내 글쓰기 과정에 대한 자주 묻는 질문
Frequently Asked Questions about My Writing Process
How I started, why I write, who I write for, how I write, and more.
-
NVIDIA GTC 2025 - LLM 기반 애플리케이션 구축
NVIDIA GTC 2025 - Building LLM-Powered Applications
Chip Huyen and I share what we've learned, best practices, and insights at NVIDIA GTC 2025.
-
LLM 시대의 추천 시스템 및 검색 개선
Improving Recommendation Systems & Search in the Age of LLMs
Model architectures, data generation, training paradigms, and unified frameworks inspired by LLMs.
-
AI 읽기 클럽 구축: 기능과 배경 이야기
Building AI Reading Club: Features & Behind the Scenes
Exploring how an AI-powered reading experience could look like.
-
-
글쓰기의 역설적 규칙들
Seemingly Paradoxical Rules of Writing
With regard to writing, there are many rules and also no rules at all.
-
주간 논문 클럽을 운영하는 방법 (그리고 학습 커뮤니티 구축하기)
How to Run a Weekly Paper Club (and Build a Learning Community)
Benefits of running a weekly paper club, how to start one, and how to read and facilitate papers.
-
나의 미니멀한 MacBook Pro 설정 가이드
My Minimal MacBook Pro Setup Guide
Setting up my new MacBook Pro from scratch
-
ML 시스템 구축, 확장, 실행 등의 39가지 교훈
39 Lessons on Building ML Systems, Scaling, Execution, and More
ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.
-
AlignEval: 평가를 쉽고 재미있으며 자동화되게 만드는 앱 구축하기
AlignEval: Building an App to Make Evals Easy, Fun, and Automated
Look at and label your data, build and evaluate your LLM-evaluator, and optimize it against your labels.
-
Weights & Biases LLM 평가기 해커톤 - 해커톤 심사위원
Weights & Biases LLM-Evaluator Hackathon - Hackathon Judge
Being a human judge at the Weights & Biases LLM-as-a-Judge Hackathon
-
다양한 웹 프레임워크를 사용하여 같은 앱 만들기
Building the Same App Using Various Web Frameworks
FastAPI, FastHTML, Next.js, SvelteKit, and thoughts on how coding assistants influence builders' choices.
-
LLM 평가자의 효율성 평가 (LLM-as-Judge)
Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)
Use cases, techniques, alignment, finetuning, and critiques against LLM-evaluators.
-
ML/AI 엔지니어 면접 및 채용하는 방법
How to Interview and Hire ML/AI Engineers
What to interview for, how to structure the phone screen, interview loop, and debrief, and a few tips.
-
AI 엔지니어 2024 키노트 - LLM 1년간의 경험과 배움
AI Engineer 2024 Keynote - What We Learned from a Year of LLMs
Special double-feature closing keynote from the 6 authors of the hit O'Reilly article on Applied LLMs.
-
Netflix PRS 2024 - 추천 경험에 LLM 적용
Netflix PRS 2024 - Applying LLMs to Recommendation Experiences
Challenges and lessons from deploying LLM experiences: evals, scalability, guardrails.
-
프롬프트 기초 및 효과적으로 적용하는 방법
Prompting Fundamentals and How to Apply them Effectively
Structured input/output, prefilling, n-shots prompting, chain-of-thought, reducing hallucinations, etc.
-
LLM으로 1년간 구축하며 배운 것들
What We've Learned From A Year of Building with LLMs
From the tactical nuts & bolts to the operational day-to-day to the long-term business strategy.
-
원숭이 마음을 길들이는 AI 코치 만들기
Building an AI Coach to Help Tame My Monkey Mind
Building an AI coach with speech-to-text, text-to-speech, an LLM, and a virtual number.
-
작동하고 작동하지 않는 작업별 LLM 평가
Task-Specific LLM Evals that Do & Don't Work
Evals for classification, summarization, translation, copyright regurgitation, and toxicity.