Andriy Burkov
THE HUNDRED-PAGE
LANGUAGE MODELS
BOOK

About the Book

The Hundred-Page Language Models Book

Large language models (LLMs) have fundamentally transformed how machines process and generate information. They are reshaping white-collar jobs at a pace comparable only to the revolutionary impact of personal computers. Understanding the mathematical foundations and inner workings of language models has become crucial for maintaining relevance and competitiveness in an increasingly automated workforce.

This book guides you through the evolution of language models, starting from machine learning fundamentals. Rather presenting transformers right away, which can feel overwhelming, we build understanding of language models step by step—from simple count-based methods through recurrent neural networks to modern architectures. Each concept is grounded in clear mathematical foundations and illustrated with working Python code.

In the largest chapter on large language models, you'll learn both effective prompt engineering techniques and how to finetune these models to follow arbitrary instructions. Through hands-on experience, you'll master proven strategies for getting consistent outputs and adapting models to your needs.

What's inside?

  • Mathematical foundations with intuitive explanations
  • Complete Python implementations with PyTorch on GitHub
  • Natural progression from simple models to transformers
  • Practical Jupyter notebooks for each topic
  • Theory, illustrations, and code combined in each chapter
  • $150 in free GPU credits on Lambda How?😲

Is the book for you?

Whether you're a technical leader, engineering manager, software developer, data scientist, or machine learning engineer, this book provides both the theoretical depth and practical implementation skills essential for working with language models.


What AI Leaders Say

Lambda Logo

Get $150 in Free GPU Credits

Purchase the book and receive $150 in free GPU credits on Lambda. Simply email your proof of purchase to author@thelmbook.com to claim your credits.

Buy the Book

Hardcover
Hardcover
Buy for $55
Paperback
Paperback
Buy for $47
When you buy a hard copy, you can request a free PDF copy.

Chapters

This book is published on the read-first, buy-later principle. All chapters will always remain available on this website.

About the Author

Andriy Burkov

Andriy Burkov is the author of "The Hundred-Page Machine Learning Book" and "Machine Learning Engineering," both of which became #1 Best Sellers on Amazon. He holds a Ph.D. in Artificial Intelligence and is a recognized expert in machine learning and natural language processing.

As a machine learning expert and leader, Andriy has successfully led dozens of production-grade AI projects in different business domains at Fujitsu and Gartner. His previous books have been translated into more than a dozen languages and are used as textbooks in many universities worldwide. His work has impacted millions of machine learning practitioners and researchers worldwide.

Currently, Andriy is the Head of Machine Learning at TalentNeuron, where he develops AI solutions for talent marketplace analytics. He uses language models and other machine learning tools to analyze billions of job postings across 30+ languages in near real time.

Stay in touch: LinkedIn, X, email, newsletter


Rotated by the RoPE algorithm