IBM Cloud Pak for Data: An enterprise platform to operationalize data, analytics, and AI

by Deepak Rangarao, Hemanth Manda, Sriram Srinivasan

Length: 336 pages
Edition: 1
Language: English
Publisher: Packt Publishing
Publication Date: 2021-11-24
ISBN-10: 1800562128
ISBN-13: 9781800562127
Sales Rank: #1435856 (See Top 100 Books)

Build end-to-end AI solutions with IBM Cloud Pak for Data to operationalize AI on a secure platform based on cloud-native reliability, cost-effective multitenancy, and efficient resource management

Key Features

Explore data virtualization by accessing data in real time without moving it
Unify the data and AI experience with the integrated end-to-end platform
Explore the AI life cycle and learn to build, experiment, and operationalize trusted AI at scale

Book Description

Cloud Pak for Data is IBM’s modern data and AI platform that includes strategic offerings from its data and AI portfolio delivered in a cloud-native fashion with the flexibility of deployment on any cloud. The platform offers a unique approach to addressing modern challenges with an integrated mix of proprietary, open-source, and third-party services.

You’ll begin by getting to grips with key concepts in modern data management and artificial intelligence (AI), reviewing real-life use cases, and developing an appreciation of the AI Ladder principle. Once you’ve gotten to grips with the basics, you will explore how Cloud Pak for Data helps in the elegant implementation of the AI Ladder practice to collect, organize, analyze, and infuse data and trustworthy AI across your business. As you advance, you’ll discover the capabilities of the platform and extension services, including how they are packaged and priced. With the help of examples present throughout the book, you will gain a deep understanding of the platform, from its rich capabilities and technical architecture to its ecosystem and key go-to-market aspects.

By the end of this IBM book, you’ll be able to apply IBM Cloud Pak for Data’s prescriptive practices and leverage its capabilities to build a trusted data foundation and accelerate AI adoption in your enterprise.

What you will learn

Understand the importance of digital transformations and the role of data and AI platforms
Get to grips with data architecture and its relevance in driving AI adoption using IBM’s AI Ladder
Understand Cloud Pak for Data, its value proposition, capabilities, and unique differentiators
Delve into the pricing, packaging, key use cases, and competitors of Cloud Pak for Data
Use the Cloud Pak for Data ecosystem with premium IBM and third-party services
Discover IBM’s vibrant ecosystem of proprietary, open-source, and third-party offerings from over 35 ISVs

Who this book is for

This book is for data scientists, data stewards, developers, and data-focused business executives interested in learning about IBM’s Cloud Pak for Data. Knowledge of technical concepts related to data science and familiarity with data analytics and AI initiatives at various levels of maturity are required to make the most of this book.

The AI Ladder: IBM’s Prescriptive Approach
Cloud Pak for Data: A Brief Introduction
Collect – Making Data Simple and Accessible
Organize – Creating a Trusted Analytics Foundation
Analyzing: Building, Deploying, and Scaling Models with Trust and Transparency
Multi-Cloud Strategy and Cloud Satellite
IBM and Partner Extension Services
Customer Use Cases
Technical Overview, Management, and Administration
Security and Compliance
Storage
Multi-Tenancy

Cover
Title Page
Copyright and Credits
Contributors
About the reviewers
Table of Contents
Preface
Section 1: The Basics
Chapter 1: The AI Ladder – IBM's Prescriptive Approach
	Market dynamics and IBM's Data and AI portfolio
	Introduction to the AI ladder
		The rungs of the AI ladder
	Collect – making data simple and accessible
	Organize – creating a trusted analytics foundation
		People empowering your data citizens
	Analyze – building and scaling models with trust and transparency
	Infuse – operationalizing AI throughout the business
		Customer service
		Risk and compliance
		IT operations
		Financial operations
		Business operations
		The case for a data and AI platform
	Summary
Chapter 2: Cloud Pak for Data: A Brief Introduction
	The case of a data and AI platform – recap
	Overview of Cloud Pak for Data
	Exploring unique differentiators, key use cases, and customer adoption
		Key use cases
		Customer use case: AI claim processing
		Customer use case: data and AI platform
	Cloud Pak for Data: additional details
		An open ecosystem
		Premium IBM cartridges and third-party services
		Industry accelerators
		Packaging and deployment options
	Red Hat OpenShift
	Summary
Section 2: Product Capabilities
Chapter 3: Collect – Making Data Simple and Accessible
	Data – the world's most valuable asset
		Data-centric enterprises
	Challenges with data-centric delivery
	Enterprise data architecture
		NoSQL data stores – key categories
	Data virtualization – accessing data anywhere
	Data virtualization versus ETL – when to use what?
	Platform connections – streamlining data connectivity
	Data estate modernization using Cloud Pak for Data
	Summary
Chapter 4: Organize – Creating a Trusted Analytics Foundation
	Introducing Data Operations (DataOps)
	Organizing enterprise information assets
	Establishing metadata and stewardship
		Business metadata components
		Technical metadata components
	Profiling to get a better understanding of your data
	Classifying data for completeness
		Automating data discovery and business term assignment
	Enabling trust with data quality
		Steps to assess data quality
		DataOps in action
		Automation rules around data quality
	Data privacy and activity monitoring
	Data integration at scale
		Considerations for selecting a data integration tool
		The extract, transform, and load (ETL) service in Cloud Pak for Data
		Advantages of leveraging a cloud-native platform for ETL
	Master data management
	Extending MDM toward a Digital Twin
	Summary
Chapter 5: Analyzing: Building, Deploying, and Scaling Models with Trust and Transparency
	Self-service analytics of governed data
	BI and reporting
	Predictive versus prescriptive analytics
	Understanding AI
	AI life cycle – Transforming insights into action
	AI governance: Trust and transparency
	Automating the AI life cycle using Cloud Pak for Data
		Data science tools for a diverse data science team
		Distributed AI
		Establishing a collaborative environment and building AI models
		Choosing the right tools to use
		ModelOps – Deployment phase
		ModelOps – Monitoring phase
		Streaming data/analytics
		Distributed processing
	Summary
Chapter 6: Multi-Cloud Strategy and Cloud Satellite
	IBM's multi-cloud strategy
	Supported deployment options
		Managed OpenShift
		AWS Quick Start
		Azure Marketplace and quickstart templates
	Cloud Pak for Data as a Service
		Packaging and pricing
	IBM Cloud Satellite
	A data fabric for a multi-cloud future
	Summary
Chapter 7: IBM and Partner Extension Services
	IBM and third-party extension services
	Collect extension services
		Db2 Advanced
		Informix
		Virtual Data Pipeline
		EDB Postgres Advanced Server
		MongoDB Enterprise Advanced
	Organize extension services
		DataStage
		Information Server
		Master Data Management
		Analyze cartridges – IBM Palantir
	Infuse cartridges
		Cognos Analytics
		Planning Analytics
		Watson Assistant
		Watson Discovery
		Watson API Kit
	Modernization upgrades to Cloud Pak for Data cartridges
		Extension services
	Summary
Chapter 8: Customer Use Cases
	Improving health advocacy program efficiency
	Voice-enabled chatbots
	Risk and control automation
	Enhanced border security
	Unified Data Fabric
	Financial planning and analytics
	Summary
Section 3: Technical Details
Chapter 9: Technical Overview, Management, and Administration
	Technical requirements
	Architecture overview
		Characteristics of the platform
		Technical underpinnings
		The operator pattern
		The platform technical stack
	Infrastructure requirements, storage, and networking
		Understanding how storage is used
		Networking
	Foundational services and the control plane
		Cloud Pak foundational services
		Cloud Pak for Data control plane
		Management and monitoring
	Multi-tenancy, resource management, and security
		Isolation using namespaces
		Resource management and quotas
		Enabling tenant self-management
	Day 2 operations
		Upgrades
		Scale-out
		Backup and restore
	Summary
	References
Chapter 10: Security and Compliance
	Technical requirements
	Security and Privacy by Design
		Development practices
		Vulnerability detection
		Delivering security assured container images
	Secure operations in a shared environment
		Securing Kubernetes hosts
		Security in OpenShift Container Platform
		Namespace scoping and service account privileges
		RBAC and the least privilege principle
		Workload notification and reliability assurance
		Additional considerations
		Encryption in motion and securing entry points
		Encryption at rest
		Anti-virus software
	User access and authorizations
		Authentication
		Authorization
		User management and groups
		Securing credentials
	Meeting compliance requirements
		Configuring the operating environment for compliance
		Auditing
		Integration with IBM Security Guardium
	Summary
	References
Chapter 11: Storage
	Understanding the concept of persistent volumes
		Kubernetes storage introduction
		Types of persistent volumes
		In-cluster storage
		Optimized hyperconverged storage and compute
		Separated compute and storage Nodes
		Provisioning procedure summary
	Off-cluster storage
		NFS-based persistent volumes
	Operational considerations
		Continuous availability with in-cluster storage
		Data protection – snapshots, backups, and active-passive disaster recovery
		Quiescing Cloud Pak for Data services
		Db2 database backups and HADR
		Kubernetes cluster backup and restore
	Summary
	Further reading
Chapter 12: Multi-Tenancy
	Tenancy considerations
		Designating tenants
		Organizational and operational implications
	Architecting for multi-tenancy
		Achieving tenancy with namespace scoping
		Ensuring separation of duties with Kubernetes RBAC and separation of duties with operators
		Securing access to a tenant instance
		Choosing dedicated versus shared compute nodes
	Reviewing the tenancy requirements
		Isolating tenants
		Tenant security and compliance
		Self-service and management
		A summary of the assessment
	In-namespace sub-tenancy with looser isolation
		Approach
		Assessing the limitations of this approach
	Summary
Other Books You May Enjoy
Index

AI & Machine Learning Artificial Intelligence Cloud Computing Data Modeling & Design Enterprise Applications Intelligence & Semantics

Donate to keep this site alive

To access the Link, solve the captcha.

How to download source code?

1. Go to: https://github.com/PacktPublishing

2. In the Find a repository… box, search the book title: IBM Cloud Pak for Data: An enterprise platform to operationalize data, analytics, and AI, sometime you may not get the results, please search the main title.

IBM Cloud Pak for Data: An enterprise platform to operationalize data, analytics, and AI

Key Features

Book Description

What you will learn

Who this book is for

Table of Contents

How to download source code?

Deep Learning Tools for Predicting Stock Market Movements

Principles of Data Science, 3rd Edition: A beginner's guide to essential math and coding skills for data fluency and machine learning

Machine Learning Infrastructure and Best Practices for Software Engineers: Take your machine learning software from a prototype to a fully fledged software system

Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning

Kubernetes Secrets Handbook: Design, implement, and maintain production-grade Kubernetes Secrets management solutions

Data Labeling in Machine Learning with Python: Explore modern ways to prepare labeled data for training and fine-tuning ML and generative AI models