Crafting your own RAG system: Leveraging 30+ LLMs for enhanced performance

Devoxx 2024Jun 20240 views

Speaker

Stephan JanssenDevoxx founder | AI-augmented programming and Java innovation expert

About this talk

In this talk you'll learn how to set up a RAG (Retrieval-Augmented Generation) system against 30+ different Large Language Models using Java. We'll show you step-by-step how to ingest documents, choose the best text splitter strategies, find similar documents, answer questions, and create a chatbot. Then, we’ll see how to test and compare different AI models, both from open sources and private ones, and whether they are stored on your own computer or accessed online. You'll walk away knowing how to setup a well balanced RAG system using Java and the best performing and/or cheapest LLM.

Let’s use IntelliJ as a game engine, just because we canAlexander Chatzizacharias
Devoxx Greece 2024 - Small steps are the fastest way forwardSander Hoogendoorn
A fun and absurd introduction to Vector DatabasesAlexander Chatzizacharias
Devoxx Greece 2024 - The lost art of software designSimon Brown
The Era of AAP: Ai Augmented Programming using only JavaStephan Janssen
From k9s to OpenTelemetry: A guide to observability for your apps in K8sMatthias Haeussle

Crafting your own RAG system: Leveraging 30+ LLMs for enhanced performance

Speaker

About this talk

More talks to watch