Crafting your own RAG system: Leveraging 30+ LLMs for enhanced performance
Speaker
About this talk
In this talk you'll learn how to set up a RAG (Retrieval-Augmented Generation) system against 30+ different Large Language Models using Java. We'll show you step-by-step how to ingest documents, choose the best text splitter strategies, find similar documents, answer questions, and create a chatbot. Then, we’ll see how to test and compare different AI models, both from open sources and private ones, and whether they are stored on your own computer or accessed online. You'll walk away knowing how to setup a well balanced RAG system using Java and the best performing and/or cheapest LLM.
More talks to watch
Let’s use IntelliJ as a game engine, just because we canAlexander Chatzizacharias
Devoxx Greece 2024 - Small steps are the fastest way forwardSander Hoogendoorn
A fun and absurd introduction to Vector DatabasesAlexander Chatzizacharias
Devoxx Greece 2024 - The lost art of software designSimon Brown
The Era of AAP: Ai Augmented Programming using only JavaStephan Janssen
From k9s to OpenTelemetry: A guide to observability for your apps in K8sMatthias Haeussle