Practical Machine Learning for Computer Vision: End-to-End Machine Learning for Images

by Martin Görner, Ryan Gillard, Valliappa Lakshmanan

Length: 350 pages
Edition: 1
Language: English
Publisher: O'Reilly Media
Publication Date: 2021-11-16
ISBN-10: 1098102363
ISBN-13: 9781098102364
Sales Rank: #0 (See Top 100 Books)

By using machine learning models to extract information from images, organizations today are making breakthroughs in healthcare, manufacturing, retail, and other industries. This practical book shows ML engineers and data scientists how to solve a variety of image problems including classification, object detection, autoencoders, image generation, counting, and captioning with proven ML techniques.

Google engineers Valliappa Lakshmanan, Martin Garner, and Ryan Gillard show you how to develop accurate and explainable computer vision ML models and put them into large-scale production using robust ML architecture in a flexible and maintainable way. You’ll learn how to design, train, evaluate, and predict with models written in TensorFlow/Keras. This book also covers best practices to improve the operationalization of the models using end-to-end ML pipelines.

You’ll learn how to:

Design ML architecture for computer vision tasks
Select a model (such as ResNet, SqueezeNet, or EfficientNet) appropriate to your task
Create an end-to-end ML pipeline to train, evaluate, deploy, and explain your model
Preprocess images for data augmentation and to support learnability
Incorporate explainability and responsible AI best practices
Deploy image models as web services or on edge devices
Monitor and manage ML models

Preface
    Who Is This Book For?
    How to Use This Book
    Organization of the Book
    Conventions Used in This Book
    Using Code Examples
    O’Reilly Online Learning
    How to Contact Us
    Acknowledgments
1. Machine Learning for Computer Vision
    Machine Learning
    Deep Learning Use Cases
    Summary
2. ML Models for Vision
    A Dataset for Machine Perception
        5-Flowers Dataset
        Reading Image Data
        Visualizing Image Data
        Reading the Dataset File
    A Linear Model Using Keras
        Keras Model
        Training the Model
    A Neural Network Using Keras
        Neural Networks
        Deep Neural Networks
    Summary
    Glossary
3. Image Vision
    Pretrained Embeddings
        Pretrained Model
        Transfer Learning
        Fine-Tuning
    Convolutional Networks
        Convolutional Filters
        Stacking Convolutional Layers
        Pooling Layers
        AlexNet
    The Quest for Depth
        Filter Factorization
        1x1 Convolutions
        VGG19
        Global Average Pooling
    Modular Architectures
        Inception
        SqueezeNet
        ResNet and Skip Connections
        DenseNet
        Depth-Separable Convolutions
        Xception
    Neural Architecture Search Designs
        NASNet
        The MobileNet Family
    Beyond Convolution: The Transformer Architecture
    Choosing a Model
        Performance Comparison
        Ensembling
        Recommended Strategy
    Summary
4. Object Detection and Image Segmentation
    Object Detection
        YOLO
        RetinaNet
    Segmentation
        Mask R-CNN and Instance Segmentation
        U-Net and Semantic Segmentation
    Summary
5. Creating Vision Datasets
    Collecting Images
        Photographs
        Imaging
        Proof of Concept
    Data Types
        Channels
        Geospatial Data
        Audio and Video
    Manual Labeling
        Multilabel
        Object Detection
    Labeling at Scale
        Labeling User Interface
        Multiple Tasks
        Voting and Crowdsourcing
        Labeling Services
    Automated Labeling
        Labels from Related Data
        Noisy Student
        Self-Supervised Learning
    Bias
        Sources of Bias
        Selection Bias
        Measurement Bias
        Confirmation Bias
        Detecting Bias
    Creating a Dataset
        Splitting Data
        TensorFlow Records
        Reading TensorFlow Records
    Summary
6. Preprocessing
    Reasons for Preprocessing
        Shape Transformation
        Data Quality Transformation
        Improving Model Quality
    Size and Resolution
        Using Keras Preprocessing Layers
        Using the TensorFlow Image Module
        Mixing Keras and TensorFlow
        Model Training
    Training-Serving Skew
        Reusing Functions
        Preprocessing Within the Model
        Using tf.transform
    Data Augmentation
        Spatial Transformations
        Color Distortion
        Information Dropping
    Forming Input Images
    Summary
7. Training Pipeline
    Efficient Ingestion
        Storing Data Efficiently
        Reading Data in Parallel
        Maximizing GPU Utilization
    Saving Model State
        Exporting the Model
        Checkpointing
    Distribution Strategy
        Choosing a Strategy
        Creating the Strategy
    Serverless ML
        Creating a Python Package
        Submitting a Training Job
        Hyperparameter Tuning
        Deploying the Model
    Summary
8. Model Quality and Continuous Evaluation
    Monitoring
        TensorBoard
        Weight Histograms
        Device Placement
        Data Visualization
        Training Events
    Model Quality Metrics
        Metrics for Classification
        Metrics for Regression
        Metrics for Object Detection
    Quality Evaluation
        Sliced Evaluations
        Fairness Monitoring
        Continuous Evaluation
    Summary
9. Model Predictions
    Making Predictions
        Exporting the Model
        Using In-Memory Models
        Improving Abstraction
        Improving Efficiency
    Online Prediction
        TensorFlow Serving
        Modifying the Serving Function
        Handling Image Bytes
    Batch and Stream Prediction
        The Apache Beam Pipeline
        Managed Service for Batch Prediction
        Invoking Online Prediction
    Edge ML
        Constraints and Optimizations
        TensorFlow Lite
        Running TensorFlow Lite
        Processing the Image Buffer
        Federated Learning
    Summary
10. Trends in Production ML
    Machine Learning Pipelines
        The Need for Pipelines
        Kubeflow Pipelines Cluster
        Containerizing the Codebase
        Writing a Component
        Connecting Components
        Automating a Run
    Explainability
        Techniques
        Adding Explainability
    No-Code Computer Vision
        Why Use No-Code?
        Loading Data
        Training
        Evaluation
    Summary
11. Advanced Vision Problems
    Object Measurement
        Reference Object
        Segmentation
        Rotation Correction
        Ratio and Measurements
    Counting
        Density Estimation
        Extracting Patches
        Simulating Input Images
        Regression
        Prediction
    Pose Estimation
        PersonLab
        The PoseNet Model
        Identifying Multiple Poses
    Image Search
        Distributed Search
        Fast Search
        Better Embeddings
    Summary
12. Image and Text Generation
    Image Understanding
        Embeddings
        Auxiliary Learning Tasks
        Autoencoders
        Variational Autoencoders
    Image Generation
        Generative Adversarial Networks
        GAN Improvements
        Image-to-Image Translation
        Super-Resolution
        Modifying Pictures (Inpainting)
        Anomaly Detection
        Deepfakes
    Image Captioning
        Dataset
        Tokenizing the Captions
        Batching
        Captioning Model
        Training Loop
        Prediction
    Summary
Afterword
Index

AI & Machine Learning Artificial Intelligence Computer Vision & Pattern Recognition Data Processing Intelligence & Semantics Neural Networks

Donate to keep this site alive

To access the Link, solve the captcha.

How to download source code?

1. Go to: https://www.oreilly.com/

2. Search the book title: Practical Machine Learning for Computer Vision: End-to-End Machine Learning for Images, sometime you may not get the results, please search the main title

3. Click the book title in the search results

3. Publisher resources section, click Download Example Code.

1. Disable the AdBlock plugin. Otherwise, you may not get any links.

2. Solve the CAPTCHA.

3. Click download link.

4. Lead to download server to download.