Tag
evaluation
2 posts
2025-03-06
RAG Evaluation: Groundedness, Relevance, and Regression Tests
A practical engineering deep dive on rag evaluation with architecture patterns, implementation guidance, and production guardrails.
2023-04-06
Evaluating LLM Apps: Quality, Cost, Latency, and Hallucinations
A practical engineering deep dive on evaluating llm apps with architecture patterns, implementation guidance, and production guardrails.