29,99 €
The Statistics and Machine Learning with R Workshop is a comprehensive resource packed with insights into statistics and machine learning, along with a deep dive into R libraries. The learning experience is further enhanced by practical examples and hands-on exercises that provide explanations of key concepts.
Starting with the fundamentals, you’ll explore the complete model development process, covering everything from data pre-processing to model development. In addition to machine learning, you’ll also delve into R's statistical capabilities, learning to manipulate various data types and tackle complex mathematical challenges from algebra and calculus to probability and Bayesian statistics. You’ll discover linear regression techniques and more advanced statistical methodologies to hone your skills and advance your career.
By the end of this book, you'll have a robust foundational understanding of statistics and machine learning. You’ll also be proficient in using R's extensive libraries for tasks such as data processing and model training and be well-equipped to leverage the full potential of R in your future projects.
Das E-Book können Sie in Legimi-Apps oder einer beliebigen App lesen, die das folgende Format unterstützen:
Seitenzahl: 626
Veröffentlichungsjahr: 2023
The Statistics and Machine Learning with R Workshop
Unlock the power of efficient data science modeling with this hands-on guide
Liu Peng
Copyright © 2023 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
Group Product Manager: Ali Abidi
Book Project Manager: Farheen Fathima
Senior Editor: Nazia Shaikh
Technical Editor: Devanshi Ayare
Copy Editor: Safis Editing
Proofreader: Safis Editing
Indexer: Tejal Daruwale Soni
Production Designer: Joshua Misquitta
DevRel Marketing Coordinator: Vinshika Kalra
First published: September 2023
Production reference: 1290923
Published by Packt Publishing Ltd
Grosvenor House
11 St Paul’s Square
Birmingham
B3 1RB
ISBN 978-1-80324-030-5
www.packtpub.com
This book is dedicated to my family, particularly my wife, Zheng, and my children, Jiaran, Jiaxin, and Jiayu. Jiaran comes first this time, as her older sister (Jiaxin) already declared victory in my other book.
Liu Peng is an assistant professor of quantitative finance (practice) at Singapore Management University and an adjunct researcher at the National University of Singapore. He holds a Ph.D. in statistics from the National University of Singapore and has 10 years of working experience as a data scientist across the banking, technology, and hospitality industries.
This volume encapsulates a decade-long odyssey through the multifaceted landscape of data science, a journey that began as a spark of personal curiosity and evolved into an integrated blend of theoretical and practical insights. I owe a debt of gratitude to my esteemed mentors—Teo Chung Piaw, Chen Ying, and Ian Wilson—who have been instrumental in shaping my academic and professional trajectory, providing unwavering support every step of the way.
Usha Rengaraju currently heads the data science research at Exa Protocol, and she is the first female triple Kaggle Grandmaster worldwide. She specializes in deep learning and probabilistic graphical models and was also one of the judges of the TigerGraph Graph for All Million Dollar Challenge. She was ranked as one of the top 10 data scientists in India by Analytics India Magazine and also ranked as one of the top 150 AI leaders and influencers by 3AI magazine. She is one of the winners of the ML in Action competition organized by the ML developer programs team at Google, and her team won first place at the WiDS Datathon 2022 organized by Stanford University. She was also the winner of the Kaggle ML Research Spotlight for 2022 and the winner of the TensorFlow Community Spotlight 2023.
Vybhavreddy KC is a dedicated data science practitioner by profession. He has fortified his passion for data with a Bachelor's degree in Computer Science and a Master's degree in Analytics. Vybhav's expertise includes leading the development of innovative ML/AI driven solutions for Compliance and Regulatory product suite. When he's not immersed in the realm of numbers and algorithms, Vybhav cherishes his free time, and loves playing with his children.
I would like to thank my wife Srilakshmi, my lovely kids Varshil and Reyansh for their unwavering support in achieving my academic and professional goals.
This part is designed to equip you with knowledge of statistical and programming fundamentals, focusing particularly on the versatile R language, which will serve as the cornerstone for more advanced topics in subsequent parts.
By the end of this part, you’ll have a strong grasp of the core statistical and programming concepts essential for any data science practitioner to understand. With these foundational skills in hand, you’ll be well prepared to delve into the more specialized topics that await you in subsequent parts of this book.
This part has the following chapters:
Chapter 1, Getting Started with RChapter 2, Data Processing with dplyrChapter 3, Intermediate Data ProcessingChapter 4, Data Visualization with ggplot2Chapter 5, Exploratory Data AnalysisChapter 6, Effective Reporting with R Markdown