Retrieval Augmented Generation articles

11/7/2025 • EN

Gemini API File Search: A Web Developer Tutorial

A tutorial on using the Gemini API's File Search feature for RAG in web development with JavaScript/TypeScript.

File Search Gemini API JavaScript Retrieval Augmented Generation SDK

Philipp Schmid

9/12/2025 • EN

Stumbling into AI: Part 3—RAG

Explains Retrieval-Augmented Generation (RAG), a pattern for improving LLM accuracy by augmenting prompts with retrieved context.

ai Apache Kafka llm Rag Retrieval Augmented Generation

Robin Moffatt

7/15/2025 • EN

Long-Term Memory for AI: How Graphiti Works for Building Real Smart Applications

Introduces Graphiti, an open-source framework for building bi-temporal knowledge graphs to give AI agents long-term memory and real-time data understanding.

AI Agents Graphiti Knowledge Graphs Python Retrieval Augmented Generation

Samuel Fajreldines

7/3/2025 • EN

AI Repo of the Week: Generative AI for Beginners with JavaScript

A hands-on guide for JavaScript developers to learn Generative AI and LLMs through interactive lessons, projects, and a companion app.

generative ai JavaScript large language models prompt engineering Retrieval Augmented Generation

Code with Dan

4/8/2025 • EN

A Journey from AI to LLMs and MCP - 4 - What Are AI Agents — And Why They're the Future of LLM Applications

Explores AI agents, their core components, differences from LLMs, and real-world applications, positioning them as the future of autonomous AI systems.

AI Agents Autonomous Systems LLM Applications Rag Retrieval Augmented Generation

Alex Merced

4/7/2025 • EN

A Journey from AI to LLMs and MCP - 3 - Boosting LLM Performance — Fine-Tuning, Prompt Engineering, and RAG

Explores three key methods to enhance LLM performance: fine-tuning, prompt engineering, and RAG, detailing their use cases and trade-offs.

ai Fine Tuning llm prompt engineering Retrieval Augmented Generation

Alex Merced

3/18/2025 • EN

Enhancing Text-to-SQL With Synthetic Summaries

Explains a technique using AI-generated summaries of SQL queries to improve the accuracy of text-to-SQL systems with LLMs.

llm Retrieval Augmented Generation SQL Generation Synthetic Data Text To SQL

Saeed Esmaili

2/9/2025 • EN

AI Engineering Architecture and User Feedback

Explores AI engineering architecture patterns and user feedback methods, from simple APIs to complex agent-based systems.

AI Architecture LLM Guardrails Model Routing Retrieval Augmented Generation User Feedback

Alex Strick van Linschoten

1/24/2025 • EN

Notes on ‘AI Engineering’ (Chip Huyen) chapter 6

Analysis of Chapter 6 from Chip Huyen's 'AI Engineering' book, focusing on RAG systems and AI agents, their architecture, costs, and relationship.

Agents AI Engineering Context Windows Rag Retrieval Augmented Generation

Alex Strick van Linschoten

8/15/2024 • EN

Discover Azure AI Foundry: Your Gateway to Advanced AI Development

An overview of Azure AI Foundry, a unified platform for building and deploying AI solutions on Microsoft Azure, covering its features and benefits.

ai development Azure AI generative ai Retrieval Augmented Generation Vector Search

Hugo Barona

7/7/2024 • EN

Trying out Microsoft’s Graph RAG

Explores Microsoft's Graph RAG, an advanced RAG technique using knowledge graphs to answer global questions about datasets, with a hands-on setup guide.

Azure AI Search Graph Rag Knowledge Graphs llm Retrieval Augmented Generation

Geert Baeke

6/17/2024 • EN

The limitations of LLMs, or why are we doing RAG?

Explains the limitations of Large Language Models (LLMs) and introduces Retrieval Augmented Generation (RAG) as a solution for incorporating proprietary data.

Chatgpt Gpt 4 llm Rag Retrieval Augmented Generation

Phil Eaton

3/24/2024 • EN

Use Azure OpenAI on your data with Semantic Kernel

Explains how to use Azure OpenAI with your own data via Semantic Kernel, focusing on RAG and Azure AI Search integration.

Azure AI Search Azure Openai Retrieval Augmented Generation Semantic Kernel Vector Search

Geert Baeke

3/15/2024 • EN

Using Azure AI Language studio to improve RAG grounding document discovery

A technical guide on using Azure AI Language Studio to summarize and optimize grounding documents for improving RAG-based AI solutions.

Azure AI Language Studio Document Summarization llm Rag Retrieval Augmented Generation

Benjamin Perkins

12/11/2023 • EN

Retrieval-Augmented Generation (RAG) simply explained

A simple explanation of Retrieval-Augmented Generation (RAG), covering its core components: LLMs, context, and vector databases.

large language models llm Rag Retrieval Augmented Generation Vector Databases

Luc van Donkersgoed

10/9/2023 • EN

AI Engineer 2023 Keynote - Building Blocks for LLM Systems

A summary of a keynote talk on essential building blocks for production LLM systems, covering evaluations, RAG, and guardrails.

AI Engineering Evaluations llm production Retrieval Augmented Generation

Eugene Yan

6/11/2023 • EN

Obsidian-Copilot: An Assistant for Writing & Reflecting

A technical overview of Obsidian-Copilot, a prototype AI assistant for drafting and reflecting within the Obsidian note-taking app using retrieval-augmented generation.

LLM Engineering obsidian Opensearch Retrieval Augmented Generation Semantic Search

Eugene Yan

5/18/2023 • EN

Grounded ChatGPT

Explains Retrieval Augmented Generation (RAG) for using ChatGPT with custom data, including a C# implementation sample.

c Chatgpt docker Elasticsearch Retrieval Augmented Generation

Stephen Cleary

1/3/2022 • EN

The Illustrated Retrieval Transformer

Explains how retrieval-augmented language models like RETRO achieve GPT-3 performance with far fewer parameters by querying external knowledge.

Deepmind large language models Retrieval Augmented Generation Retro Transformer Architecture

Jay Alammar

Retrieval Augmented Generation Articles