Llm articles

6/23/2023 • EN

LLM Powered Autonomous Agents

An overview of LLM-powered autonomous agents, covering their core components like planning, memory, and tool use for complex problem-solving.

autonomous agents llm Memory Planning Tool Use

Lilian Weng

6/15/2023 • EN

Running Large Language Models locally – Your own ChatGPT-like AI in C#

A guide to running open-source Large Language Models (LLMs) like LLaMA locally on your CPU using C# and the LLamaSharp library.

c llm Local Inference Machine Learning open source

Maarten Balliauw

6/14/2023 • EN

Finetuning Falcon LLMs More Efficiently With LoRA and Adapters

A guide to efficiently finetuning Falcon LLMs using parameter-efficient methods like LoRA and adapters to reduce compute costs.

Adapters Falcon Finetuning llm Lora

Sebastian Raschka

6/8/2023 • EN

We may finally crack Maths. But should we?

Explores the potential and implications of using AI to automate mathematical theorem proving, framing it as a 'tame' problem solvable by machines.

artificial intelligence llm mathematics search algorithms Theorem Proving

Ferenc Huszár

5/31/2023 • EN

Genesis 1 but every word begins with 'A' - with GPT4

An AI-generated, alliterative rewrite of Genesis 1 where every word starts with the letter 'A', created using GPT-4.

ai Constrained Text Generation Gpt 4 llm Natural Language Processing

Piotr Migdał

5/21/2023 • EN

Some Intuition on Attention and the Transformer

Explains the intuition behind the Attention mechanism and Transformer architecture, focusing on solving issues in machine translation and language modeling.

Attention Mechanism Deep Learning llm NLP Transformer

Eugene Yan

4/30/2023 • EN

Interacting with LLMs with Minimal Chat

Explores user interfaces for LLMs that minimize text chat, using clicks and user context for more intuitive interactions.

llm NLP recommendation systems ui-design user experience

Eugene Yan

4/24/2023 • EN

Prompt Engineering is for Transactional Prompting

The article distinguishes between interactive and transactional prompting, arguing that prompt engineering is most valuable for transactional, objective tasks with LLMs.

Interactive Prompting Language Models llm prompt engineering Transactional Prompting

Mitchell Hashimoto

4/16/2023 • EN

Raspberry-LLM - Making My Raspberry Pico a Little Smarter

A developer explores running LLMs on a Raspberry Pi Pico with memory constraints, creating a witty e-ink display that generates content from news feeds.

Embedded Systems iot llm Micro Python raspberry pi

Eugene Yan

4/11/2023 • EN

Cryptos and LLMs

A software engineer compares the hype around cryptocurrencies and LLMs, arguing that LLMs provide tangible value while crypto is plagued by scams.

ai Cryptocurrency generative ai llm startups

Rui Peres

4/9/2023 • EN

Experimenting with LLMs to Research, Reflect, and Plan

A developer shares experiments building LLM-powered tools for research, reflection, and planning, including URL summarizers, SQL agents, and advisory boards.

Agents AI Assistant Langchain llm Retrieval

Eugene Yan

3/30/2023 • EN

Autoregressive Models, OOD Prompts and the Interpolation Regime

Explores autoregressive models, their relationship to joint distributions, and how they handle out-of-distribution prompts, with insights relevant to LLMs.

Autoregressive Models Generative Modeling Inductive Biases llm Machine Learning

Ferenc Huszár