Philipp Schmid

Philipp Schmid is a Staff Engineer at Google DeepMind, building AI Developer Experience and DevRel initiatives. He specializes in LLMs, RLHF, and making advanced AI accessible to developers worldwide.

https://www.philschmid.de

RSS Feed

1/22/2026

AI LLMs developer experience Google DeepMind RLHF

Articles from this Blog

183 articles from this blog

7/3/2025 • EN

Integrating Long-Term Memory with Gemini 2.5

A guide to adding long-term memory to a Gemini 2.5 chatbot using the Mem0 library and vector databases for personalized AI interactions.

llm AI Agents Gemini API

6/30/2025 • EN

The New Skill in AI is Not Prompting, It's Context Engineering

Explains why Context Engineering, not just prompt crafting, is the key skill for building effective AI agents and systems.

llm prompt engineering Context Engineering

6/20/2025 • EN

Single vs Multi-Agent System?

Explores the trade-offs between single-agent and multi-agent AI systems, discussing their characteristics, pros, and cons for different tasks.

software architecture llm AI Agents

5/5/2025 • EN

Zero to One: Learning Agentic Patterns

Explores common design patterns for building AI agents and workflows, discussing when to use them and how to implement core concepts.

software architecture design patterns AI Agents

4/28/2025 • EN

Google Gemini LangChain Cheatsheet

A technical cheatsheet for using Google's Gemini AI models with the LangChain framework, covering setup, chat models, prompt templates, and image inputs.

ai development llm prompt engineering

4/17/2025 • EN

OpenAI Codex CLI, how does it work?

Explains the architecture and workflow of OpenAI's Codex CLI, a terminal-based AI tool for chat-driven software development.

cli openai codex ai coding

4/3/2025 • EN

Model Context Protocol (MCP) an overview

An overview of the Model Context Protocol (MCP), an open standard for connecting AI applications to external tools and data sources.

api client-server llm

3/31/2025 • EN

ReAct agent from scratch with Gemini 2.5 and LangGraph

A tutorial on building a ReAct AI agent from scratch using Google's Gemini 2.5 Pro/Flash and the LangGraph framework for complex reasoning and tool use.

React llm AI Agents

3/24/2025 • EN

Pass@k vs Pass^k: Understanding Agent Reliability

Explains the difference between Pass@k and Pass^k metrics for evaluating AI agent reliability, highlighting why consistency matters in production.

code generation benchmarking AI Agents

3/14/2025 • EN

Google Gemma 3 Function Calling Example

A tutorial on implementing function calling with Google's Gemma 3 27B LLM, showing how to connect it to external tools and APIs.

api llm Structured Output

3/5/2025 • EN

Function Calling Guide: Google DeepMind Gemini 2.0 Flash

A practical guide to implementing function calling with Google's Gemini 2.0 Flash model, enabling LLMs to interact with external tools and APIs.

llm API Integration Structured Output

2/7/2025 • EN

From PDFs to Insights: Structured Outputs from PDFs with Gemini 2.0

A tutorial on using Google's Gemini 2.0 AI models to extract structured data like invoice numbers and dates from PDF documents.

Python Pydantic Gemini API

1/30/2025 • EN

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

A tutorial on reproducing DeepSeek R1's RL 'aha moment' using Group Relative Policy Optimization (GRPO) to train a model on the Countdown numbers game.

Reasoning Reinforcement Learning Grpo

1/23/2025 • EN

How to align open LLMs in 2025 with DPO and and synthetic data

A technical guide on aligning open-source large language models (LLMs) in 2025 using Direct Preference Optimization (DPO) and synthetic data.

Post Training Synthetic Data Direct Preference Optimization

1/17/2025 • EN

How to use Anthropic MCP Server with open LLMs, OpenAI or Google Gemini

A guide on using Anthropic's Model Context Protocol (MCP) to connect AI agents with tools and data sources using various LLMs like OpenAI or Gemini.

llm mcp Openai

1/17/2025 • EN

Bite: How Deepseek R1 was trained

Explains the training of DeepSeek-R1, focusing on the Group Relative Policy Optimization (GRPO) reinforcement learning method.

Reinforcement Learning Deepseek LLM Training

12/25/2024 • EN

Fine-tune classifier with ModernBERT in 2025

A tutorial on fine-tuning the ModernBERT model for classification tasks to build an efficient LLM router, covering setup, training, and evaluation.

classification Fine Tuning Bert

12/20/2024 • EN

How to fine-tune open LLMs in 2025 with Hugging Face

A technical guide on optimizing and scaling the fine-tuning of open-source large language models using Hugging Face tools in 2025.

Hugging Face Peft Distributed Training

12/3/2024 • EN

Deploy QwQ-32B-Preview the best open Reasoning Model on AWS with Hugging Face

A technical guide on deploying the QwQ-32B-Preview open-source reasoning model on AWS SageMaker using Hugging Face's tools.

aws Hugging Face Amazon Sagemaker

10/17/2024 • EN

Deploy Llama 3.2 Vision on Amazon SageMaker

A technical guide on deploying Meta's Llama 3.2 Vision model on Amazon SageMaker using the Hugging Face LLM DLC.

large language models Hugging Face Llama 32

Previous 1 2 3 4 5 ... 10 Next

Philipp Schmid

Articles from this Blog

Select Language