All talks

Crafting your own RAG system: Leveraging 30+ LLMs for enhanced performance

Devoxx 2024Jun 20240 views

Speaker

About this talk

In this talk you'll learn how to set up a RAG (Retrieval-Augmented Generation) system against 30+ different Large Language Models using Java. We'll show you step-by-step how to ingest documents, choose the best text splitter strategies, find similar documents, answer questions, and create a chatbot. Then, we’ll see how to test and compare different AI models, both from open sources and private ones, and whether they are stored on your own computer or accessed online. You'll walk away knowing how to setup a well balanced RAG system using Java and the best performing and/or cheapest LLM.