The Unsupervised Learning Workshop, 2nd Edition

by Aaron Jones, Benjamin Johnston, Christopher Kruger

Length: 550 pages
Edition: 2
Language: English
Publisher: Packt Publishing
Publication Date: 2020-07-29
ISBN-10: 1800200706
ISBN-13: 9781800200708
Sales Rank: #0 (See Top 100 Books)

Learning how to apply unsupervised algorithms on unlabeled datasets from scratch can be easier than you thought with this beginner’s workshop, featuring interesting examples and activities

Key Features

Get familiar with the ecosystem of unsupervised algorithms
Learn interesting methods to simplify large amounts of unorganized data
Tackle real-world challenges, such as estimating the population density of a geographical area

Book Description

Do you find it difficult to understand how popular companies like WhatsApp and Amazon find valuable insights from large amounts of unorganized data? The Unsupervised Learning Workshop will give you the confidence to deal with cluttered and unlabeled datasets, using unsupervised algorithms in an easy and interactive manner.

The book starts by introducing the most popular clustering algorithms of unsupervised learning. You’ll find out how hierarchical clustering differs from k-means, along with understanding how to apply DBSCAN to highly complex and noisy data. Moving ahead, you’ll use autoencoders for efficient data encoding.

As you progress, you’ll use t-SNE models to extract high-dimensional information into a lower dimension for better visualization, in addition to working with topic modeling for implementing natural language processing (NLP). In later chapters, you’ll find key relationships between customers and businesses using Market Basket Analysis, before going on to use Hotspot Analysis for estimating the population density of an area.

By the end of this book, you’ll be equipped with the skills you need to apply unsupervised algorithms on cluttered datasets to find useful patterns and insights.

What you will learn

Distinguish between hierarchical clustering and the k-means algorithm
Understand the process of finding clusters in data
Grasp interesting techniques to reduce the size of data
Use autoencoders to decode data
Extract text from a large collection of documents using topic modeling
Create a bag-of-words model using the CountVectorizer

Who this book is for

If you are a data scientist who is just getting started and want to learn how to implement machine learning algorithms to build predictive models, then this book is for you. To expedite the learning process, a solid understanding of the Python programming language is recommended, as you’ll be editing classes and functions instead of creating them from scratch.

Introduction to Clustering
Hierarchical Clustering
Neighborhood Approaches and DBSCAN
Dimensionality Reduction Techniques and PCA
Autoencoders
t-Distributed Stochastic Neighbor Embedding
Topic Modeling
Market Basket Analysis
Hotspot Analysis

Computers & Technology
- Computer Science
- Programming Languages

AI & Machine Learning Intelligence & Semantics Neural Networks Python

Donate to keep this site alive

To access the Link, solve the captcha.

How to download source code?

1. Go to: https://github.com/PacktPublishing

2. In the Find a repository… box, search the book title: The Unsupervised Learning Workshop, 2nd Edition, sometime you may not get the results, please search the main title.