Understanding RAG @ All Things Open AI 2025

What I learned while creating Bookshelf (open source RAG Application)

I had the opportunity to speak at the AllThingsOpen.ai 2025 Conference recently. It was a wonderful experience to learn from and share with so many amazing fellow presenters and attendees. All Things Open AI 2025 session recordings are available now.

I spoke about several key aspects of Retrieval-Augmented Generation (RAG) and vector databases. The main topics I addressed included The Core Concepts of RAG, Embedding Models vs. Inference Models, Handling Multiple Embeddings and various Optimizations Beyond Naive RAG such as chunking strategies, enrichment with metadata, auto-merging retriever, and reranking models. Finally, I touched on Local Execution and GPU Utilization.

You can watch the recording here: YouTube – How I created Bookshelf: An Open Source AI-Powered Personal Knowledge Base – Ash Tewari

The slides can be downloaded from here: (Google Drive Link)

References and Links:

AI Powered Bookshelf – Blog Post
HuggingFace – MTEB Leaderboard
PyTorch – Local Execution
Medium – A Brief Introduction to Retrieval Augmented Generation(RAG)
Microsoft – RAG Architecture
Amazon – Retrieval Augmented Generation