Llm articles

5/12/2024 • EN

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

A technical review of April 2024's major open LLM releases (Mixtral, Llama 3, Phi-3, OpenELM) and a comparison of DPO vs PPO for LLM alignment.

Dpo llm Ppo Reinforcement Learning Transformer

Sebastian Raschka

5/12/2024 • EN

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

A review and comparison of the latest open LLMs (Mixtral, Llama 3, Phi-3, OpenELM) and a study on DPO vs. PPO for LLM alignment.

llm Mixture Of Experts Ppo Reinforcement Learning Transformer

Sebastian Raschka

3/18/2024 • EN

LLMs Shouldn't Write SQL

Argues against using LLMs to generate SQL queries for novel business questions, highlighting the importance of human analysts for precision.

data analysis Data Modeling llm Query Generation sql

Saeed Esmaili

3/15/2024 • EN

Using Azure AI Language studio to improve RAG grounding document discovery

A technical guide on using Azure AI Language Studio to summarize and optimize grounding documents for improving RAG-based AI solutions.

Azure AI Language Studio Document Summarization llm Rag Retrieval Augmented Generation

Benjamin Perkins

3/12/2024 • EN

Optimizing Technical Docs for LLMs

Practical tips for writing technical documentation that is optimized for LLM question-answering tools, improving developer experience.

API Documentation Code Snippets developer experience llm Technical Documentation

Saeed Esmaili

3/3/2024 • EN

Research Papers in February 2024

A summary of February 2024 AI research, covering new open-source LLMs like OLMo and Gemma, and a study on small, fine-tuned models for text summarization.

AI Research Finetuning llm open source Summarization

Sebastian Raschka

3/3/2024 • EN

Research Papers in February 2024

A summary of key AI research papers from February 2024, focusing on new open-source LLMs, small fine-tuned models, and efficient fine-tuning techniques.

AI Research Finetuning Gemma llm open source

Sebastian Raschka

2/20/2024 • EN

There Is a Huge Gap in Generative Ai

Explores the gap between generative AI's perceived quality in open-ended play and its practical effectiveness for specific, goal-oriented tasks.

ai development generative ai llm Machine Learning software engineering

Saeed Esmaili

2/13/2024 • EN

I worry our Copilot is leaving some passengers behind

A developer's critical reflection on GitHub Copilot's impact, questioning if its AI assistance is creating accessibility and quality divides in software development.

AI Coding Assistant developer tools Github Copilot llm software development

Josh Collinsworth

2/11/2024 • EN

How to Generate and Use Synthetic Data for Finetuning

Explores methods for generating synthetic data (distillation & self-improvement) to fine-tune LLMs for pretraining, instruction-tuning, and preference-tuning.

Finetuning Instruction Tuning llm Preference Tuning Synthetic Data

Eugene Yan

1/25/2024 • EN

Running a local LLM with Ollama

A guide on running a Large Language Model (LLM) locally using Ollama for privacy and offline use, covering setup and performance tips.

llm Local AI Model Deployment Ollama privacy

Jan Ouwens

1/23/2024 • EN

RLHF in 2024 with DPO and Hugging Face

A technical guide on using Direct Preference Optimization (DPO) with Hugging Face's TRL library to align and improve open-source large language models in 2024.

Dpo Hugging Face llm Rlhf Trl

Philipp Schmid

1/7/2024 • EN

Language Modeling Reading List (to Start Your Paper Club)

A curated reading list of fundamental language modeling papers with summaries, designed to help start a weekly paper club for learning and discussion.

Language Modeling llm Paper Club Research Transformer

Eugene Yan

1/3/2024 • EN

Is the ChatGPT API Refusing to Summarize Academic Papers? Not so fast.

Investigates why ChatGPT 3.5 API sometimes refuses to summarize arXiv papers, exploring prompts, content, and model behavior.

Academic Papers Chatgpt API llm Openai API Summarization

Matt Mazur

12/21/2023 • EN

LLM Model Serving on Autopilot

A guide to deploying and running your own LLM on Google Kubernetes Engine (GKE) Autopilot for control, privacy, and cost management.

Autopilot Gke Kubernetes llm Model Serving

William Denniss

12/11/2023 • EN

Retrieval-Augmented Generation (RAG) simply explained

A simple explanation of Retrieval-Augmented Generation (RAG), covering its core components: LLMs, context, and vector databases.

large language models llm Rag Retrieval Augmented Generation Vector Databases

Luc van Donkersgoed

11/30/2023 • EN

LLM-Supported Development

A developer's experience using Sweep, an LLM-powered tool that generates pull requests to write unit tests and fix code in a GitHub workflow.

ai coding github llm software development unit testing

James Smith

11/14/2023 • EN

Exploring ChatGPT’s Knowledge Cutoff

An analysis of ChatGPT's knowledge cutoff date, testing its accuracy on celebrity death dates to understand the limits of its training data.

api Chatgpt Gpt 4 Knowledge Cutoff llm

Matt Mazur

11/5/2023 • EN

Out-of-Domain Finetuning to Bootstrap Hallucination Detection

Explores using out-of-domain data to improve LLM finetuning for detecting factual inconsistencies (hallucinations) in text summaries.

Finetuning Hallucination Detection llm Machine Learning Natural Language Inference

Eugene Yan

10/25/2023 • EN

Adversarial Attacks on LLMs

Explores adversarial attacks and jailbreak prompts that can make large language models produce unsafe or undesired outputs, bypassing safety measures.

Adversarial Attacks Jailbreak Prompts large language models llm security

Lilian Weng