Hands-On LLM Serving and Optimization

Hosting LLMs at Scale

Description

As the demand for real-time AI applications grows, along comes this comprehensive guide to the complexities of deploying and optimizing LLMs at scale. The authors take a real-world approach backed by practical examples and code, and assemble essential strategies for designing infrastructures that are equal to the demands of modern AI applications.
Free shipping from
€ 19,95 within The Netherlands
Writer
Wang, Chi, Hu, Peiheng
Title
Hands-On LLM Serving and Optimization
Publisher
O'Reilly Media
Year
2026
Language
English
Pages
300
EAN
9798341621497
Binding format
Paperback

You will always receive the last edition from us!


Categories

Boekstra