Gratis verzending vanaf 59,99 euro.

Status van bestelling controleren

Word lid van een gemeenschap van boekenliefhebbers van over de hele wereld en krijg een heleboel voordelen. Gratis account aanmaken

Gratis bezorging met Zásilkovna boven 59.99 €

DPD koerier 5.49 € DHL koeriersdienst 5.49 € GLS koerier 4.99 € DPD-punt 3.99 €

Contact

Hoe winkelen bij ons werkt

Help

Mijn account

▸ Leeg :-(

Gratis verzending vanaf 59,99 euro.

AI Inference Optimization Engineering

Name: AI Inference Optimization Engineering
Brand: Independently published
SKU: 52770465
Price: 11.41 EUR
Availability: InStock
Author: ChatVariety Team
ISBN: 9798199720021

Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment

ChatVariety Team

Taal

Engels

Boek Gebonden (paperback)

Libristo-code: 52770465

Uitgeverij Independently published, juni 2026

Slash LLM Deployment Costs and LatencyDeploying Large Language Models (LLMs) in production is a mass... Volledige beschrijving

Libristo-code: 52770465

28 b

Wordt verwacht Nieuw

Nieuw

11.41 €

Naar verwachting op voorraad Op voorraad op 07. 06. 2026

Retourneren binnen 30 dagen

Slash LLM Deployment Costs and Latency

Deploying Large Language Models (LLMs) in production is a massive economic and engineering hurdle. AI Inference Optimization Engineering is your comprehensive, hands-on guide to mastering the full stack of modern LLM optimization techniques. From memory-bandwidth solutions to hardware-specific compilation, this book bridges the gap between research-level models and enterprise-grade execution.

What you will master inside this book:

Hardware-Aware Optimization: Dive deep into KV cache mechanics, autoregressive decoding, and GPU memory hierarchies to eliminate latency bottlenecks.
State-of-the-Art Quantization: Apply GPTQ, AWQ, and GGUF compression algorithms to scale down massive neural networks without sacrificing model accuracy.
Advanced Acceleration Methods: Implement speculative decoding with draft models (like Medusa and Eagle), PagedAttention, and FlashAttention to boost throughput by 2-3x.
Production-Grade Serving: Build ultra-low-latency deployment infrastructures using vLLM, Triton Inference Server, and continuous batching.
Cross-Platform Deployment: Optimize models for specific target hardware, including NVIDIA H100 (TensorRT-LLM), Apple Silicon (llama.cpp/Metal), and Qualcomm mobile/edge accelerators.

Whether you are an ML infrastructure engineer, an AI platform architect, or a technical leader looking to scale LLMs cost-effectively, this book provides the production-ready code, equations, and architectural patterns you need to build hyper-efficient AI pipelines.

Actrice & Polyglot

EWA KASP voor

Video afspelen

Libristo heeft de grootste selectie boeken in vreemde talen. Daarom koop ik mijn boeken hier.

Informatie over het boek

Volledige naam AI Inference Optimization Engineering

Auteur ChatVariety Team

Taal

Engels

Bindwijze Boek - Gebonden (paperback)

Datum van uitgifte 2026

Aantal pagina's 96

EAN 9798199720021

Libristo-code 52770465

Uitgeverij Independently published

Gewicht 142

Afmetingen 152 x 229 x 5

Categorieën

Informatica en informatietechnologie > Informatica > Kunstmatige intelligentie (AI) > Natuurlijke taal en computervertaling

Geef dit boek vandaag nog cadeau

Dat gaat heel eenvoudig

1 Voeg het boek toe aan je winkelwagentje en selecteer Als cadeau bezorgen 2 Je krijgt van ons per omgaand een voucher 3 Het boek wordt bezorgd op het adres van de ontvanger

Vaak gezocht

Categories

Authors

Publishers

Vaak gezocht

Artikelen

Categories

Authors

Publishers

Bezorging

Winkelgids

AI Inference Optimization Engineering

Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment

Informatie over het boek

Categorieën

Geef dit boek vandaag nog cadeau

Dat gaat heel eenvoudig

Vaak gezocht

Categories

Authors

Publishers

AI Inference Optimization Engineering

Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment

Informatie over het boek

Categorieën

Geef dit boek vandaag nog cadeau

Dat gaat heel eenvoudig

Heb je geen account? Profiteer van de voordelen van een Libristo-account!