Pre Training Articles

Page 1 of 1 (4 articles)

8/17/2024 • EN

New LLM Pre-training and Post-training Paradigms

Analyzes the latest pre-training and post-training methodologies used in state-of-the-art LLMs like Qwen 2, Apple's models, Gemma 2, and Llama 3.1.

Fine Tuning Language Models llm Post Training Pre Training

Sebastian Raschka

8/17/2024 • EN

New LLM Pre-training and Post-training Paradigms

A technical review of the latest pre-training and post-training methodologies used in state-of-the-art large language models (LLMs) like Qwen 2 and Llama 3.1.

ai large language models llm Post Training Pre Training

Sebastian Raschka

7/7/2024 • EN

Extrinsic Hallucinations in LLMs

Explores the causes and types of hallucinations in large language models, focusing on extrinsic hallucinations and how training data affects factual accuracy.

Factuality Fine Tuning Hallucination llm Pre Training

Lilian Weng