29,99 €
The Polars Cookbook is a comprehensive, hands-on guide to Python Polars, one of the first resources dedicated to this powerful data processing library. Written by Yuki Kakegawa, a seasoned data analytics consultant who has worked with industry leaders like Microsoft and Stanford Health Care, this book offers targeted, real-world solutions to data processing, manipulation, and analysis challenges. The book also includes a foreword by Marco Gorelli, a core contributor to Polars, ensuring expert insights into Polars' applications.
From installation to advanced data operations, you’ll be guided through data manipulation, advanced querying, and performance optimization techniques. You’ll learn to work with large datasets, conduct sophisticated transformations, leverage powerful features like chaining, and understand its caveats. This book also shows you how to integrate Polars with other Python libraries such as pandas, numpy, and PyArrow, and explore deployment strategies for both on-premises and cloud environments like AWS, BigQuery, GCS, Snowflake, and S3.
With use cases spanning data engineering, time series analysis, statistical analysis, and machine learning, Polars Cookbook provides essential techniques for optimizing and securing your workflows. By the end of this book, you'll possess the skills to design scalable, efficient, and reliable data processing solutions with Polars.
Das E-Book können Sie in Legimi-Apps oder einer beliebigen App lesen, die das folgende Format unterstützen:
Seitenzahl: 271
Veröffentlichungsjahr: 2024
Polars Cookbook
Over 60 practical recipes to transform, manipulate, and analyze your data using Python Polars 1.x
Yuki Kakegawa
Copyright © 2024 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
Group Product Manager:Apeksha Shetty
Publishing Product Manager:Deepesh Patel
Book Project Manager: Farheen Fathima and Urvi Sharma
Senior Editor:Nazia Shaikh
Technical Editor: Kavyashree K S
Copy Editor: Safis Editing
Proofreader:Nazia Shaikh
Indexer:Pratik Shirodkar
Production Designers: Aparna Bhagat, Shankar Kalbhor, and Prafulla Nikalje
Senior DevRel Marketing Coordinator:Nivedita Singh
First published: August 2024
Production reference: 2020924
Published by Packt Publishing Ltd.
Grosvenor House
11 St Paul’s Square
Birmingham
B3 1RB, UK
ISBN 978-1-80512-115-2
www.packtpub.com
First and foremost, I’m forever grateful for my wife, who encouraged me to go on this endeavor and supported me throughout the process. Without her support and sacrifice, I couldn’t have written this book, let alone build my career.
Second, a big thanks to the Packt team, who ensured the quality and timeline of the book.
Third, I’d like to thank the author of Polars, Ritchie Vink, and other contributors who made Polars come to life and continue to develop it.
Finally, I’d like to express my gratitude to you, the readers. Thank you for reading my book.
– Yuki Kakegawa
"Came for the speed, stayed for the syntax"
That's a common refrain among Polars enthusiasts. Indeed, the Polars API is truly beautiful: not only does it make for very readable code, but it also allows you to express complex aggregations that just aren't expressible with the pandas API.
Yuki has been a long-time fan of Polars. He has professional experience as a consultant. It's great to see him pair these together to produce a cookbook of practical recipes that you can use to solve real problems.
When should you use Polars? I think the best time is when you're starting a new project. Porting pandas code to Polars is certainly possible, but it's not necessarily easy. If you try thinking in Polars at the start of a new project, you'll likely surprise yourself with how expressive its API truly is, you'll use it idiomatically, and you'll make full use of its amazing features.
I'm sure you'll love learning about Polars whilst reading this book. And when you start your next data science project - please join the Polars Discord to say hello! Would love to hear about your experience!
Marco Gorelli
Polars and Pandas Contributor | Senior Software Engineer, Quansight
Yuki Kakegawa is a data analytics professional with a background in computer science. Yuki has worked in the data space for the past several years, most of which has been spent in consulting, focusing on data engineering, analytics, and business intelligence. His clients are from various industries, such as healthcare, education, insurance, and private equity. He has worked with various companies, including Microsoft and Stanford Health Care, to name a couple.
He also runs Orem Data, a data analytics consultancy that helps companies improve their existing data and analytics infrastructure.
Aside from work, Yuki enjoys playing baseball and softball with his wife and friends.
Mihai Gurău is an analytics and data professional with over eight years of experience, focusing on the “why?” behind analytics to drive meaningful action. In the airline industry, he has helped build bespoke revenue management decision support tools. His process mining implementation work effectively melded analytics with enterprise IT systems for process discovery and improvement. Nowadays, he contributes to fine-tuning product analytics and building robust data platform components for map-making and connected products and services. Beyond his professional pursuits, Mihai enjoys watersports and tries to keep abreast of relevant advancements in data and analytics engineering.