Data Analysis and Machine Learning with Kaggle: How to win competitions on Kaggle and build a successful career in data science
- Length: 384 pages
- Edition: 1
- Language: English
- Publisher: Packt Publishing
- Publication Date: 2021-11-09
- ISBN-10: 1801817472
- ISBN-13: 9781801817479
- Sales Rank: #0 (See Top 100 Books)
Get a step ahead of your competitors with a concise collection of smart data handling and modeling techniques
Key Features
- Learn how Kaggle works and how to make the most of competitions from two expert Kagglers
- Sharpen your modeling skills with ensembling, feature engineering, adversarial validation, AutoML, transfer learning, and techniques for parameter tuning
- Discover tips, tricks, and best practices for winning on Kaggle and becoming a better data scientist
Book Description
Millions of data enthusiasts from around the world compete on Kaggle, the most famous data science competition platform of them all. Participating in Kaggle competitions is a surefire way to improve your data analysis skills, network with the rest of the community, and gain valuable experience to help grow your career.
The first book of its kind, Data Analysis and Machine Learning with Kaggle assembles the techniques and skills you’ll need for success in competitions, data science projects, and beyond. Two masters of Kaggle walk you through modeling strategies you won’t easily find elsewhere, and the tacit knowledge they’ve accumulated along the way. As well as Kaggle-specific tips, you’ll learn more general techniques for approaching tasks based on image data, tabular data, textual data, and reinforcement learning. You’ll design better validation schemes and work more comfortably with different evaluation metrics.
Whether you want to climb the ranks of Kaggle, build some more data science skills, or improve the accuracy of your existing models, this book is for you.
What you will learn
- Get acquainted with Kaggle and other competition platforms
- Make the most of Kaggle Notebooks, Datasets, and Discussion forums
- Understand different modeling tasks including binary and multi-class classification, object detection, NLP (Natural Language Processing), and time series
- Design good validation schemes, learning about k-fold, probabilistic, and adversarial validation
- Get to grips with evaluation metrics including MSE and its variants, precision and recall, IoU, mean average precision at k, as well as never-before-seen metrics
- Handle simulation and optimization competitions on Kaggle
- Create a portfolio of projects and ideas to get further in your career
Who This Book Is For
This book is suitable for Kaggle users and data analysts/scientists of all experience levels who are trying to do better in Kaggle competitions and secure jobs with tech giants.
Table of Contents
- Introducing Data Science competitions
- Organizing Data with Datasets
- Working and learning with kaggle notebooks
- Leveraging Discussion forums
- Detailing competition tasks and metrics
- Designing good validation schemes
- Ensembling and stacking solutions
- Modelling for tabular competitions
- Modeling for image classification and segmentation
- Modeling for Natural Language Processing
- Handling simulation and optimization competitions
- Creating your portfolio of projects and ideas
- Finding new professional opportunities
B17574_03 Data Analysis and Machine Learning with Kaggle: How to win competitions on Kaggle and build a successful career in data science Introducing data science competitions The rise of data science competition platforms Kaggle competition platform Other competition platforms Stages of a competition Types of competitions and examples Submission and leaderboard dynamics Computational resources Teaming and networking Performance tiers and rankings Criticism and opportunities Organizing Data with Datasets Setting up a dataset Gathering the data Using the Kaggle datasets outside of Kaggle Building around datasets Legal caveats Working and Learning with Kaggle Notebooks Setting up a kernel Upgrade to GCP One step beyond Kaggle courses
How to download source code?
1. Go to: https://github.com/PacktPublishing
2. In the Find a repository… box, search the book title: Data Analysis and Machine Learning with Kaggle: How to win competitions on Kaggle and build a successful career in data science
, sometime you may not get the results, please search the main title.
3. Click the book title in the search results.
3. Click Code to download.
1. Disable the AdBlock plugin. Otherwise, you may not get any links.
2. Solve the CAPTCHA.
3. Click download link.
4. Lead to download server to download.