Beyond the Prompt: Evaluating, Testing, and Securing LLM Applications
Speaker
About this talk
This talk discusses evaluating and securing LLM applications by measuring changes in prompts or RAG pipelines. It highlights evaluation frameworks like Vertex AI Evaluation, DeepEval, and Promptfoo, and introduces security measures using LLM Guard to ensure resilience against prompt injections and harmful responses, emphasizing the need for robust input-output guardrails.
More talks to watch
Kotlin - the new and noteworthyAnton Arhipov
Dockerfiles, Jib ..., what's the best way to run your Java code in Containers?Matthias Haeussle
How to survive as a developer in the exponential age of AI - KeynoteSander Hoogendoorn
Your frontend is ☠️ ⚠️ Let's measure its impact with CO2 jsKo Turk
Let’s use IntelliJ as a game engine, just because we canAlexander Chatzizacharias
Onion, Hexagonal, Clean or Fractal Architecture? All of them, and more!Urs Enzler