Scheda corso
NovaNext Training / IBM / IBM Data and AI / IBM InfoSphere QualityStage Essentials v11.7

IBM InfoSphere QualityStage Essentials v11.7

Codice
KM214G
Durata
5 Giorni
Prezzo
4.250,00 € (iva escl.)
Lingua
Italiano
Modalità
Virtual Classroom
       

 

Schedulazione
Luogo Data Iscrizione
A Richiesta

This course teaches how to build QualityStage parallel jobs that investigate, standardize, match, and consolidate data records.

This course covers common data quality issues, QualityStage architecture, QualityStage clients and their functions, importing metadata, running jobs and reviewing results, building Investigate jobs, the Standardize stage and rule sets, identifying matching records and applying multiple Match passes, building a Survive job, and using a Two-Source match.

Students will gain experience by building an application that combines customer data from three source systems into a single master customer record.

 

Prerequisiti

Participants should have the following skills:

  • Familiarity with the Windows Operating System
  • Familiarity with a text editor
  • Helpful, but not required: Some understanding of elementary statistics principles such as weighted averages and probabilities.

 

Obiettivi

After completing this course, learners should be able to:

  • List common data quality contaminants
  • Describe QualityStage architecture, clients, and their functions
  • Build and run DataStage and QualityStage jobs and review results
  • Use Character Discrete, Concatenate, and Word Investigations to analyze data fields
  • Build jobs using the Standardize stage
  • Build a QualityStage job to identify matching records
  • Interpret, improve, and consolidate match results

 

Destinatari
  • Data analysts responsible for data quality using QualityStage
  • Data quality architects
  • Data cleansing developers

 

Contenuti

Data Quality Issues

Exercise 1: Pre-lab Prep

QualityStage Overview

Exercise 1: QualityStage Logon

Developing with QualityStage

Exercise 1: Import Table Definition Metadata

Exercise 2: Build a QualityStage Job

Investigate

Build Investigate Jobs

Standardize

Exercise 1: Standardize Country

Exercise 2: Select US Records

Exercise 3: Standardize USPREP

Exercise 4: Standardize USNAME, USADDR, and USAREA

Exercise 5: Investigate Unhandled Patterns

Exercise 6: Apply Rule Set Overrides

Match

Exercise 1: Create match Frequency Job

Exercise 2: One-source Match Specification

Exercise 3: Build a One-source Job using Match Specification

Survive

Exercise 1: Survivorship

Exercise 2: Create Customer Load File

Two-Sort Match

Exercise 1: Read the Case Study

Exercise 2: Prepare the Data Environment

Exercise 3: Run the Two-Source Match Job