Machine Learning Cookbook with Python: Create ML and Data Analytics Projects Using Some Amazing Open Datasets
- Length: 264 pages
- Edition: 1
- Language: English
- Publisher: BPB Publications
- Publication Date: 2020-11-11
- ISBN-10: 9389898005
- ISBN-13: 9789389898002
- Sales Rank: #3915544 (See Top 100 Books)
A Cookbook that will help you implement Machine Learning algorithms and techniques bybuilding real-world projects
Key Features
- Learn how to handle an entire Machine Learning Pipeline supported with adequate mathematics.
- Create Predictive Models and choose the right model for various types of Datasets.
- Learn the art of tuning a model to improve accuracy as per Business requirements.
- Get familiar with concepts related to Data Analytics with Visualization, Data Science and Machine Learning.
Description
Machine Learning does not have to be intimidating at all. This book focuses on the concepts of Machine Learning and Data Analytics with mathematical explanations and programming examples. All the codes are written in Python as it is one of the most popular programming languages used for Data Science and Machine Learning. Here I have leveraged multiple libraries like NumPy, Pandas, scikit-learn, etc. to ease our task and not reinvent the wheel. There are five projects in total, each addressing a unique problem. With the recipes in this cookbook, one will learn how to solve Machine Learning problems for real-time data and perform Data Analysis and Analytics, Classification, and beyond. The datasets used are also unique and will help one to think, understand the problem and proceed towards the goal. The book is not saturated with Mathematics, but mostly all the Mathematical concepts are covered for the important topics. Every chapter typically starts with some theory and prerequisites, and then it gradually dives into the implementation of the same concept using Python, keeping a project in the background.
What will you learn
- Understand the working of the O.S.E.M.N. framework in Data Science.
- Get familiar with the end-to-end implementation of Machine Learning Pipeline.
- Learn how to implement Machine Learning algorithms and concepts using Python.
- Learn how to build a Predictive Model for a Business case.
Who this book is for
This cookbook is meant for anybody who is passionate enough to get into the World of Machine Learning and has a preliminary understanding of the Basics of Linear Algebra, Calculus, Probability, and Statistics. This book also serves as a reference guidebook for intermediate Machine Learning practitioners.
Cover Page Title Page Copyright Page Dedication Page About the Author Acknowledgement Preface Errata Table of Contents 1. Boston Crime Introduction Structure Objective What is Data? Types of Data Let’s talk about the Boston dataset Data Dictionary O.S.E.M.N. framework What is Data Obtaining? What is Data Scrubbing? Finding Data Types How to Handle Missing Data? What to Do with Duplicate Values? What is Data Exploring? Univariate Feature Analysis What is Statistical Distribution? Multivariate Feature Analysis Further reading Conclusion 2. World Happiness Report Introduction Structure Objective Let’s Talk about the Dataset Pre-requisites Data Exploration Different Types of Data Data represented on Nominal (Categorical) Scale Data represented on Ordinal Scale Measurement of Data Discrete Variables Continuous Variables Interval Scale Circular Scale Ratio Scale Distributions Normal Distribution Distribution Plot Kernel Density Estimation Box Plot Skewed Distribution Left Skewed (Negative Skewness) Distribution Plot Kernel Density Estimation Box Plot Right Skewed (Positive Skewness) Distribution Plot Kernel Density Estimation Box Plot Other Distributions Further Reading Conclusion 3. Iris Species Introduction Structure Objective Introduction to Machine Learning What is Machine Learning? Types of Machine Learning Supervised Learning Unsupervised Learning Reinforcement Learning Machine Learning Pipeline Understanding the dataset Exploratory Data Analysis Load Dataset Scikit-learn Seaborn Data Analysis Pair Plot Correlation How to Choose ML Algorithms? Classification K - nearest Neighbors (K-nn) How Does It Work? Model Training Evaluation Model Tuning Clustering K-means How Does It Work? Model Training Evaluation & Model Tuning Regression Linear Regression How Does It Work? Model Training Evaluation Further reading Conclusion 4. Credit Card Fraud Detection Introduction Structure Objective Let’s Understand the Data What is an Imbalanced Dataset? Knowing the Features Prerequisites Normalization/Feature Scaling Min-max Feature Scaling (Rescaling) Mean Normalization Standardization (Z-score Normalization) Principal Component Analysis Cross-validation Data Analysis Scaling Standard Scaler Robust Scaling Power Transformer Quantile Transformer Splitting Dataset Handling Imbalance Random Undersampling Random Oversampling Tomek’s Links for Undersampling Synthetic Minority Oversampling Technique Data Reanalysis Positive Feature (V1..., V28) Negative Feature (V1.. ., V28) Scaled Features Modeling Machine Learning Algorithms Logistic Regression Decision Tree Support Vector Machine (SVM) Model Training Data Preparation Metric Trap Training & Evaluation Further reading Conclusion 5. Heart Disease UCI Introduction Structure Objective Prerequisites Why is ML in Medicine So Critical? What is Explainable AI? Regularization Hypothesis Testing Ensemble Learning Bagging Boosting Let’s Understand the Data Data Analysis Correlation “ca” “oldpeak” “thalach” “cp” Splitting Dataset Machine Learning Modeling Machine Learning Algorithms Naive Bayes Classifier Random Forest Model Training Training & Basic Evaluation Naive Bayes Ensemble Method (Bagging) Ensemble Method (Boosting) Random Forest Feature Selection & Grid Searching Feature Selection Pipeline Grid Searching Explainable AI Feature Importance Model Visualization Partial Dependence Plot SHAP (SHapley Additive exPlanations) Further reading Conclusion
Donate to keep this site alive
1. Disable the AdBlock plugin. Otherwise, you may not get any links.
2. Solve the CAPTCHA.
3. Click download link.
4. Lead to download server to download.