Rag articles

6/25/2024 • EN

Train and Deploy open Embedding Models on Amazon SageMaker

A guide to fine-tuning and deploying custom embedding models for RAG applications on Amazon SageMaker using Sentence Transformers v3.

Amazon Sagemaker Embedding Models Hugging Face Rag Sentence Transformers

Philipp Schmid

6/18/2024 • EN

Full Local RAG scenario using #Phi3, #SemanticKernel and TextMemory. Bonus: Test in CodeSpaces

A tutorial on implementing a local RAG system using Phi-3, Semantic Kernel, and TextMemory in a C# console application.

c Phi 3 Rag Semantic Kernel Text Memory

Bruno Capuano

6/17/2024 • EN

The limitations of LLMs, or why are we doing RAG?

Explains the limitations of Large Language Models (LLMs) and introduces Retrieval Augmented Generation (RAG) as a solution for incorporating proprietary data.

Chatgpt Gpt 4 llm Rag Retrieval Augmented Generation

Phil Eaton

6/4/2024 • EN

Fine-tune Embedding models for Retrieval Augmented Generation (RAG)

A guide to fine-tuning embedding models for RAG applications using Sentence Transformers 3, featuring Matryoshka Representation Learning for efficiency.

Embedding Models Fine Tuning Matryoshka Representation Learning Rag Sentence Transformers

Philipp Schmid

6/2/2024 • EN

To Chunk or Not to Chunk With the Long Context Single Embedding Models

An experiment comparing retrieval performance of chunked vs. non-chunked documents using long-context embedding models like BGE-M3.

Chunking Context Window Embeddings Rag Retrieval

Saeed Esmaili

5/12/2024 • EN

What We've Learned From A Year of Building with LLMs

A practical guide sharing lessons learned from a year of building real-world applications with Large Language Models (LLMs).

AI Evals large language models LLM Applications prompt engineering Rag

Eugene Yan

4/6/2024 • EN

Building a RAG for tabular data in Go with PostgreSQL & Gemini

A technical guide on building a Retrieval-Augmented Generation (RAG) system in Go to query PostgreSQL tabular data using Google's Gemini LLM.

Gemini go postgresql Rag Vertex AI

Paolo Galeone

3/15/2024 • EN

Using Azure AI Language studio to improve RAG grounding document discovery

A technical guide on using Azure AI Language Studio to summarize and optimize grounding documents for improving RAG-based AI solutions.

Azure AI Language Studio Document Summarization llm Rag Retrieval Augmented Generation

Benjamin Perkins

2/10/2024 • EN

Retrieval with the Azure OpenAI Assistants API

Explains how to implement document retrieval with the Azure OpenAI Assistants API using a custom RAG approach, as the retrieval tool is not yet natively supported.

Assistants API Azure Openai Rag Retrieval Vector Storage

Geert Baeke

12/11/2023 • EN

Retrieval-Augmented Generation (RAG) simply explained

A simple explanation of Retrieval-Augmented Generation (RAG), covering its core components: LLMs, context, and vector databases.

large language models llm Rag Retrieval Augmented Generation Vector Databases

Luc van Donkersgoed

10/30/2023 • EN

Evaluate LLMs and RAG a practical example using Langchain and Hugging Face

A hands-on guide to evaluating LLMs and RAG systems using Langchain and Hugging Face, covering criteria-based and pairwise evaluation methods.

Gpt 4 Hugging Face Langchain LLM Evaluation Rag

Philipp Schmid

8/13/2023 • EN

How to Match LLM Patterns to Problems

A guide to selecting the right LLM architectural patterns (like RAG, fine-tuning, caching) to solve common production challenges such as performance metrics and data constraints.

Fine Tuning LLM Applications LLM Patterns LLM Production Rag

Eugene Yan