29,99 €
Serving patterns enable data science and ML teams to bring their models to production. Most ML models are not deployed for consumers, so ML engineers need to know the critical steps for how to serve an ML model.
This book will cover the whole process, from the basic concepts like stateful and stateless serving to the advantages and challenges of each. Batch, real-time, and continuous model serving techniques will also be covered in detail. Later chapters will give detailed examples of keyed prediction techniques and ensemble patterns. Valuable associated technologies like TensorFlow severing, BentoML, and RayServe will also be discussed, making sure that you have a good understanding of the most important methods and techniques in model serving. Later, you’ll cover topics such as monitoring and performance optimization, as well as strategies for managing model drift and handling updates and versioning. The book will provide practical guidance and best practices for ensuring that your model serving pipeline is robust, scalable, and reliable. Additionally, this book will explore the use of cloud-based platforms and services for model serving using AWS SageMaker with the help of detailed examples.
By the end of this book, you'll be able to save and serve your model using state-of-the-art techniques.
Das E-Book können Sie in Legimi-Apps oder einer beliebigen App lesen, die das folgende Format unterstützen:
Seitenzahl: 419
Veröffentlichungsjahr: 2022
A definitive guide to deploying, monitoring, and providing accessibility to ML models in production
Md Johirul Islam
BIRMINGHAM—MUMBAI
Copyright © 2022 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author , nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
Group Product Manager: Ali Abidi
Publishing Product Manager: Ali Abidi
Senior Editor: Tiksha Lad
Technical Editor: Rahul Limbachiya
Copy Editor: Safis Editing
Project Coordinator: Aparna Ravikumar Nair
Proofreader: Safis Editing
Indexer: Pratik Shirodkar
Production Designer: Shyam Sundar Korumilli
Marketing Coordinator: Shifa Ansari
First published: December 2022
Production reference: 3060223
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham
B3 2PB, UK.
ISBN 978-1-80324-990-2
www.packtpub.com
Md Johirul Islam is a data scientist and engineer at Amazon. He has a PhD in computer science and is also an adjunct professor at Purdue University. His expertise is focused on designing explainable, maintainable, and robust data science pipelines, applying software design principles, and helping organizations deploy machine learning models into production at scale.
Quan V. Dang is a machine learning engineer with experience in various domains, including finance, e-commerce, and logistics. He started his professional career as a researcher at the University of Aizu, where he mainly worked on classical machine learning and evolutionary algorithms. After graduating from university, he shifted his focus to deploying AI products and managing machine learning infrastructure. He is also the founder of the MLOps VN community, with over 5,000 people discussing MLOps and machine learning engineering. In his free time, he often writes technical blogs, hangs out, and goes traveling.
In this part, we will give an overview of model serving and explain why it is a challenge. We will also introduce you to a naive approach for serving models and discuss its drawbacks.
This section contains the following chapters:
Chapter 1, Introducing Model ServingChapter 2, Introducing Model Serving Patterns