Get $150 in Free GPU Credits
Purchase the book and receive $150 in free GPU credits on Lambda. Simply email your proof of purchase to author@thelmbook.com to claim your credits.
Large language models (LLMs) have fundamentally transformed how machines process and generate information. They are reshaping white-collar jobs at a pace comparable only to the revolutionary impact of personal computers. Understanding the mathematical foundations and inner workings of language models has become crucial for maintaining relevance and competitiveness in an increasingly automated workforce.
This book guides you through the evolution of language models, starting from machine learning fundamentals. Rather presenting transformers right away, which can feel overwhelming, we build understanding of language models step by step—from simple count-based methods through recurrent neural networks to modern architectures. Each concept is grounded in clear mathematical foundations and illustrated with working Python code.
In the largest chapter on large language models, you'll learn both effective prompt engineering techniques and how to finetune these models to follow arbitrary instructions. Through hands-on experience, you'll master proven strategies for getting consistent outputs and adapting models to your needs.
What's inside?
Is the book for you?
Whether you're a technical leader, engineering manager, software developer, data scientist, or machine learning engineer, this book provides both the theoretical depth and practical implementation skills essential for working with language models.
Purchase the book and receive $150 in free GPU credits on Lambda. Simply email your proof of purchase to author@thelmbook.com to claim your credits.
This book is published on the read-first, buy-later principle. All chapters will always remain available on this website.
Andriy Burkov is the author of "The Hundred-Page Machine Learning Book" and "Machine Learning Engineering," both of which became #1 Best Sellers on Amazon. He holds a Ph.D. in Artificial Intelligence and is a recognized expert in machine learning and natural language processing.
As a machine learning expert and leader, Andriy has successfully led dozens of production-grade AI projects in different business domains at Fujitsu and Gartner. His previous books have been translated into more than a dozen languages and are used as textbooks in many universities worldwide. His work has impacted millions of machine learning practitioners and researchers worldwide.
Currently, Andriy is the Head of Machine Learning at TalentNeuron, where he develops AI solutions for talent marketplace analytics. He uses language models and other machine learning tools to analyze billions of job postings across 30+ languages in near real time.
Stay in touch: LinkedIn, X, email, newsletter