Hands-On LLM Serving and Optimization

Hosting LLMs at Scale

Description

As the demand for real-time AI applications grows, along comes this comprehensive guide to the complexities of deploying and optimizing LLMs at scale. The authors take a real-world approach backed by practical examples and code, and assemble essential strategies for designing infrastructures that are equal to the demands of modern AI applications.

In the shopping cart 17 working days Wishlist Ask a question

Free shipping from
€ 19,95 within The Netherlands

Writer: Wang, Chi, Hu, Peiheng
Title: Hands-On LLM Serving and Optimization
Publisher: O'Reilly Media

Year: 2026
Language: English
Pages: 300

EAN: 9798341621497
Binding format: Paperback

You will always receive the last edition from us!

Hands-On LLM Serving and Optimization

Hosting LLMs at Scale

Description

Categories

Ask a question