Small Language Models for Edge AI: Deploying Efficient LLMs on IoT and Embedded Devices
Small Language Models for Edge AI: Deploying Efficient LLMs on IoT and Embedded Devices
Dati e Statistiche
Salvato in 0 liste dei desideri
Small Language Models for Edge AI: Deploying Efficient LLMs on IoT and Embedded Devices
Scaricabile subito
26,49 €
26,49 €
Scaricabile subito

Descrizione


Big language models are impressive. They're also power-hungry, cloud-dependent, and about as edge-friendly as a data center strapped to a toaster. This book is for everyone who's asked the obvious question: What if we made language models small enough to actually live on devices? Small Language Models for Edge AI is a practical, motivating, and occasionally sarcastic guide to running efficient language models on IoT and embedded systems—where memory is tiny, power is precious, and latency is not a suggestion. Inside, you'll learn how modern language models actually work (without drowning in theory), then quickly move into the real-world engineering problems that matter: shrinking models, optimizing inference, managing memory, and squeezing performance out of hardware that was never designed to run chatbots. From microcontrollers and SoCs to NPUs and edge accelerators, this book shows how to design models that respect hardware limits instead of ignoring them. You'll explore proven techniques like quantization, pruning, knowledge distillation, and low-rank adaptation—explained clearly and applied specifically to edge AI. We'll cover toolchains such as TensorFlow Lite and ONNX, walk through efficient deployment pipelines, and discuss how to fine-tune and personalize models without setting your device on fire (thermally or emotionally). Security, privacy, and reliability are treated as first-class citizens, not afterthoughts. Learn why on-device language models are a win for privacy, how to protect models from extraction, and what it takes to deploy responsibly at scale. Real-world case studies—from smart homes to industrial IoT, wearables, robotics, and automotive systems—ground the concepts in practical experience. Finally, we'll look ahead: ultra-low-power models, hardware–model co-design, federated learning, and where edge language intelligence is actually going (hint: smaller, smarter, and everywhere). This book is written for embedded engineers, AI practitioners, system architects, and curious builders who want language models that ship, not just demo. If you're tired of cloud dependency, excited by efficient AI, and ready to make language models work in the real world, this book is for you. Small models. Big impact. Welcome to the edge.

Dettagli

Tutti i dispositivi (eccetto Kindle) Scopri di più
Reflowable
9798233969775

Compatibilità

Formato:

Gli eBook venduti da Feltrinelli.it sono in formato ePub e possono essere protetti da Adobe DRM. In caso di download di un file protetto da DRM si otterrà un file in formato .acs, (Adobe Content Server Message), che dovrà essere aperto tramite Adobe Digital Editions e autorizzato tramite un account Adobe, prima di poter essere letto su pc o trasferito su dispositivi compatibili.

Compatibilità:

Gli eBook venduti da Feltrinelli.it possono essere letti utilizzando uno qualsiasi dei seguenti dispositivi: PC, eReader, Smartphone, Tablet o con una app Kobo iOS o Android.

Cloud:

Gli eBook venduti da Feltrinelli.it sono sincronizzati automaticamente su tutti i client di lettura Kobo successivamente all’acquisto. Grazie al Cloud Kobo i progressi di lettura, le note, le evidenziazioni vengono salvati e sincronizzati automaticamente su tutti i dispositivi e le APP di lettura Kobo utilizzati per la lettura.

Clicca qui per sapere come scaricare gli ebook utilizzando un pc con sistema operativo Windows