Non-exhaustive list of sources used in this course and evals tools that can help you.
Bronnen
We relied on several sources to write this series, including:
- AI Engineering: Building Applications with Foundation Models, Chip Huyen
- Het risico van QA voor LLM-applicaties verlagen - Michael Hablich, Chrome DevTools
- Using LLM-as-a-Judge For Evaluation: A Complete Guide - Hamel Husain
Evaluatietools
Voorbeelden van evaluatieoplossingen en -instrumenten zijn:
- AlignEval
- Sta op
- Braintrust
- Datadog
- DeepEval
- Gen AI-evaluatieservice en API van Vertex AI
- Inspectie-evaluaties
- RechterLM
- LangSmith
- Evaluatieharnas
- OpenEvals
This list is not exhaustive. If you are using other eval tools, share them with us .