Llm articles

10/22/2023 • EN

Ollama - Building a Custom Model

A guide on using Ollama's Modelfile to create and deploy a custom large language model (LLM) for specific tasks, like an API security assistant.

API Security Custom Model llm Modelfile Ollama

Unmesh Gundecha

10/15/2023 • EN

Reflections on AI Engineer Summit 2023

Key takeaways from the AI Engineer Summit 2023, focusing on challenges in LLM deployment like evaluation methods and serving costs.

AI Engineering deployment Eval llm Serving Costs

Eugene Yan

10/14/2023 • EN

Ollama - running large language models on your machine

A guide to using Ollama, an open-source CLI tool for running and customizing large language models like Llama 2 locally on your own machine.

command line llm Local AI Ollama Transformer

Unmesh Gundecha

10/10/2023 • EN

Multimodality and Large Multimodal Models (LMMs)

An in-depth exploration of Large Multimodal Models (LMMs), covering their fundamentals, key architectures like CLIP and Flamingo, and current research directions.

Clip Flamingo Large Multimodal Models llm Multimodal AI

Chip Huyen

10/9/2023 • EN

AI Engineer 2023 Keynote - Building Blocks for LLM Systems

A summary of a keynote talk on essential building blocks for production LLM systems, covering evaluations, RAG, and guardrails.

AI Engineering Evaluations llm production Retrieval Augmented Generation

Eugene Yan

9/24/2023 • EN

TWIL: September 24, 2023

A developer's weekly learning log covering Azure Machine Learning, Prompt Flow, Microsoft Fabric, Copilot, and an LLM hallucination paper.

Azure Machine Learning llm Microsoft Copilot Microsoft Fabric Prompt Flow

André Vala

9/20/2023 • EN

LLMs Demand Observability-Driven Development

Explains why traditional debugging fails for LLMs and advocates for observability-driven development to manage their non-deterministic nature in production.

debugging llm observability Production Systems software development

Charity Majors

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Strategies for improving LLM performance through dataset-centric fine-tuning, focusing on instruction datasets rather than model architecture changes.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Explores dataset-centric strategies for fine-tuning LLMs, focusing on instruction datasets to improve model performance without altering architecture.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

9/8/2023 • EN

Asking a Large Language Model How YouTube Works

A technical guide on using an LLM (Platypus2) with LangChain and pgvector to analyze YouTube's Procella database paper.

Langchain Llamacpp llm Pgvector postgresql

Mark Litwintschik

8/31/2023 • EN

Optimize open LLMs using GPTQ and Hugging Face Optimum

A guide to using GPTQ quantization with Hugging Face Optimum to compress open-source LLMs for efficient deployment on smaller hardware.

Gptq Hugging Face llm Optimum Quantization

Philipp Schmid

8/29/2023 • EN

AI crap

A critical analysis of the machine learning bubble, arguing its lasting impact will be a proliferation of low-quality, automated content and services, not true AGI.

Agi AI Bubble automation llm Machine Learning

Drew DeVault

8/27/2023 • EN

TWIL: August 27, 2023

A developer's weekly learning log covering Power BI data refresh, LLM architectures, Azure OpenAI costs, AI news, Python in Excel, and Azure SQL updates.

Azure Azure Openai Data Refresh llm Power Bi

André Vala

8/10/2023 • EN

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

A guide to participating in the NeurIPS 2023 LLM Efficiency Challenge, focusing on efficient fine-tuning of large language models on a single GPU.

Efficient Training Finetuning Gpu llm Neural Networks

Sebastian Raschka

8/3/2023 • EN

Introducing EasyLLM - streamline open LLMs

Introduces EasyLLM, an open-source Python package for streamlining work with open large language models via OpenAI-compatible clients.

Huggingface llm open source Openai API Python

Philipp Schmid

8/3/2023 • EN

Summer Side Projects: From Academic Insights to Movie Nights

A developer shares two summer side projects: an academic paper digest app and a movie selection tool for groups, built to solve personal problems.

api JavaScript llm side projects Web Development

John Deliyiannis

7/31/2023 • EN

TWIL: July 31, 2023

Weekly tech digest covering Azure OpenAI architecture, vector databases, AI anomaly detection, and an LLM self-cloning article.

AI Anomaly Detector Azure Openai Landing Zone llm Vector Databases

André Vala

7/30/2023 • EN

Patterns for Building LLM-based Systems & Products

A practical guide outlining seven key patterns for integrating Large Language Models (LLMs) into robust, production-ready systems and products.

caching Fine Tuning Guardrails llm Rag

Eugene Yan

7/23/2023 • EN

TWIL: July 23, 2023

Weekly tech roundup covering major Microsoft AI announcements: Bing Chat Enterprise, Microsoft 365 Copilot pricing, Azure AI updates, and new LLM architectures.

Azure AI Function Calling generative ai llm Openai

André Vala