Embeddings articles

12/19/2025 • EN

Vector Search in Oracle Database 26ai

A technical guide on implementing vector search in Oracle Database 26ai, using a car manual as a practical example to improve semantic search.

ai Embeddings Oracle Database sql Vector Search

Kellyn Gorman

12/19/2025 • EN

Sam Rose explains how LLMs work with a visual essay

A visual essay explaining LLM internals like tokenization, embeddings, and transformer architecture in an accessible way.

Embeddings llm Prompt Caching Tokenization Transformer Architecture

Simon Willison

12/15/2025 • EN

Elephant(s) in the room: Graph neural networks, embeddings, and foundation models in spatial data science

Explores the application of Graph Neural Networks, embeddings, and foundation models to spatial data science, with practical examples in R.

Deep Learning Embeddings Foundation Models Graph Neural Networks Spatial Data Science

Jakub Nowosad

9/29/2025 • EN

Spelungit: When `git log –grep` isn’t enough

Introducing Spelungit, a semantic search tool for Git commit history that uses natural language queries instead of exact keywords.

Claude Code Commit History Embeddings git Semantic Search

Phil Haack

9/6/2025 • EN

Building Semantic Search with Amazon S3 Vectors and Semantic Kernel

A guide to implementing semantic search for static websites using Amazon S3 Vector Buckets and Microsoft's Semantic Kernel for embedding generation.

Amazon Bedrock Amazon S3 Embeddings Semantic Search Vector Database

Milan Jovanović

4/6/2025 • EN

A Journey from AI to LLMs and MCP - 2 - How LLMs Work — Embeddings, Vectors, and Context Windows

Explains how LLMs work by converting words to numerical embeddings, using vector spaces for semantic understanding, and managing context windows.

Context Windows Embeddings llm Transformers Vectors

Alex Merced

6/2/2024 • EN

To Chunk or Not to Chunk With the Long Context Single Embedding Models

An experiment comparing retrieval performance of chunked vs. non-chunked documents using long-context embedding models like BGE-M3.

Chunking Context Window Embeddings Rag Retrieval

Saeed Esmaili

11/30/2023 • EN

Finding images with text and image queries with the help of GPT-4 Vision

Building an image search system using GPT-4 Vision and Azure AI to find images via text queries or similar pictures.

Azure AI Search computer vision Embeddings Gpt 4 Vision Multimodal Search

Geert Baeke

11/28/2023 • EN

Using Integrated Vectorization in Azure AI Search

Explains how to use Azure AI Search's integrated vectorization for automatic query and field vectorization, with portal and indexer examples.

Azure AI Search Embeddings indexing Openai Vector Search

Geert Baeke

8/10/2023 • EN

The ABCs of AI Transformers, Tokens, and Embeddings: A LEGO Story

Explains AI transformers, tokens, and embeddings using a simple LEGO analogy to demystify how language models process and understand text.

AI Architecture Embeddings Natural Language Processing tokens Transformers

Code with Dan

5/30/2023 • EN

Datacast Episode 117: Vector Databases, The Embeddings Revolution, and Working in China with Frank Liu

Interview with Frank Liu on vector databases, embeddings, his career in ML/hardware, and work culture differences between China and the US.

computer vision Embeddings Hardware Engineering Machine Learning Vector Databases

James Le

3/24/2023 • EN

Enhancing Blog Post Search with Chunk-based Embeddings and Pinecone

Explains a chunk-based embedding method using LangChain and Pinecone to improve blog post search accuracy and efficiency.

Chunking Embeddings Langchain Pinecone Vector Search

Geert Baeke

3/21/2023 • EN

Storing and querying for embeddings with Redis

A guide on using Redis as a vector database to store and query embeddings for semantic search, replacing Pinecone in a tech stack.

Embeddings Openai Redi Vector Database Vector Search

Geert Baeke

3/16/2023 • EN

Pinecone and OpenAI magic: A guide to finding your long lost blog posts with vectorized search and ChatGPT

A technical guide on using Pinecone vector search and OpenAI's API to build a semantic search engine for personal blog posts.

Cosine Similarity Embeddings Openai Pinecone Vector Search

Geert Baeke

3/8/2022 • EN

Creating document embeddings with Hugging Face's Transformers and Amazon SageMaker

Guide to deploying a Sentence Transformers model on Amazon SageMaker for generating document embeddings using Hugging Face's Inference Toolkit.

Amazon Sagemaker Embeddings Hugging Face Inference Transformers

Philipp Schmid

1/21/2019 • EN

Think your Data Different

Explains how Graph Neural Networks and node2vec use graph structure and random walks to generate embeddings for machine learning tasks.

Deep Learning Embeddings Graph Neural Networks Machine Learning Word2vec

Yoel Zeldes

4/8/2018 • EN

Word morphing

Explores word morphing using word2vec embeddings and A* search to find semantic paths between words, like 'tooth' to 'light'.

A Algorithm Embeddings Graph Search Natural Language Processing Word2vec

Yoel Zeldes

Embeddings Articles

Vector Search in Oracle Database 26ai

Sam Rose explains how LLMs work with a visual essay

Elephant(s) in the room: Graph neural networks, embeddings, and foundation models in spatial data science

Spelungit: When `git log –grep` isn’t enough

Building Semantic Search with Amazon S3 Vectors and Semantic Kernel

A Journey from AI to LLMs and MCP - 2 - How LLMs Work — Embeddings, Vectors, and Context Windows

To Chunk or Not to Chunk With the Long Context Single Embedding Models

Finding images with text and image queries with the help of GPT-4 Vision

Using Integrated Vectorization in Azure AI Search

The ABCs of AI Transformers, Tokens, and Embeddings: A LEGO Story

Datacast Episode 117: Vector Databases, The Embeddings Revolution, and Working in China with Frank Liu

Enhancing Blog Post Search with Chunk-based Embeddings and Pinecone

Storing and querying for embeddings with Redis

Pinecone and OpenAI magic: A guide to finding your long lost blog posts with vectorized search and ChatGPT

Creating document embeddings with Hugging Face's Transformers and Amazon SageMaker

Think your Data Different

Word morphing

Select Language