Crisp Dm User Guide

User Manual: Pdf

Open the PDF directly: View PDF PDF.
Page Count: 14

DownloadCrisp-dm User Guide
Open PDF In BrowserView PDF
The CRISP-DM User Guide
Brussels SIG Meeting
Pete Chapman
NCR Systems Engineering Copenhagen
email: Pete.Chapman@Copenhagen.NCR.com

1

Agenda
■ CRISP-DM

Objectives and Benefits

■ CRISP-DM

Deliverables

■ CRISP-DM

Methodology, Phases and Tasks

■ CRISP-DM

User Guide

■ Possible

CRISP-DM Futures

2

Objectives and Benefits of CRISP-DM
◆

ensure quality of knowledge discovery project results

◆

reduce skills required for knowledge discovery

◆

reduce costs and time

◆

general purpose (i.e., stable across varying applications)

◆

robust (i.e., insensitive to changes in the environment)

◆

tool and technique independent

◆

tool supportable

◆

support documentation of projects

◆

capture experience for reuse

◆

support knowledge transfer and training

3

CRISP-DM Deliverables
◆

Process Model
◆
◆
◆
◆

◆

Tool Support
◆
◆

◆

Methodology
Reference Model
User Guide
Output (Deliverable/Templates)
Tool Support Definitions
Stream Library

Experimentation
◆
◆

Experimentation Reports
CRISP-DM SIG User Feedback

4

CRISP-DM Methodology
Phases
CRISP
Process Model
Generic Tasks

Mapping
Specialized Tasks

Process Instances

CRISP
Process

5

Data Mining Contexts
Generic Tasks

Application Domains

Problem Types

Technical Aspects

Tools and Techniques

•
•
•

•
•
•
•
•

•
•
•

•
•
•
•

Response Modeling
Churn Prediction
...

Data Description / Summarization
Segmentation
Concept Description
Predictive Modeling
Dependency Analysis

Missing Values
Outliers
...

Clementine
MineSet
Decision Trees
...

Specialized Tasks

6

CRISP-DM Phases
Business
Understanding

Data
Understanding

Data
Preparation
Deployment

Data
Data
Data
Modelling

Evaluation

7

Phases and Tasks
Business
Understanding

Determine
Business Objectives
Background
Business Objectives
Business Success
Criteria

Data
Understanding

Collect Initial Data
Initial Data Collection
Report

Data
Preparation

Modeling

Data Set
Data Set Description

Select Modeling
Technique
Modeling Technique
Modeling Assumptions

Evaluation

Evaluate Results
Assessment of Data
Mining Results w.r.t.
Select Data
Business Success
Describe Data
Rationale for Inclusion /
Criteria
Data Description Report
Exclusion
Generate Test Design Approved Models
Test Design
Situation Assessment Explore Data
Clean Data
Review Process
Inventory of Resources Data Exploration Report Data Cleaning Report
Review of Process
Build Model
Requirements,
Parameter Settings
Assumptions, and
Verify Data Quality
Construct Data
Models
Determine Next Steps
Constraints
Data Quality Report
Derived Attributes
List of Possible Actions
Model Description
Risks and Contingencies
Generated Records
Decision
Terminology
Assess Model
Costs and Benefits
Integrate Data
Model Assessment
Merged Data
Revised Parameter
Determine
Settings
Data Mining Goal
Format Data
Data Mining Goals
Reformatted Data
Data Mining Success
Criteria

Deployment

Plan Deployment
Deployment Plan
Plan Monitoring and
Maintenance
Monitoring and
Maintenance Plan
Produce Final Report
Final Report
Final Presentation
Review Project
Experience
Documentation

Produce Project Plan
Project Plan
Initial Asessment of
Tools and Techniques

8

Introduction to the User Guide
Reference Model
What To Do?
Generic Tasks

Context

User Guide
How To Do?
• check lists
• questionaires
• tools
• sequences of steps
• decision points
• pitfalls

Specialized Tasks

9

CRISP-DM User Guide

10

How to use the User Guide (i)
◆

Contents of the User Guide
- More detailed description of the various tasks using:
◆
◆
◆
◆

◆

Activities List
Check Lists
Good Ideas
Warnings!

What is NOT in the User Guide
◆
◆
◆
◆

Deliverables/Document Templates (as yet)
Description of Techniques and Tools (as yet)
Estimates of engagements
Quality Indicators

11

How to use the User Guide (ii)
◆

Beginning Data Miners
◆
◆
◆
◆
◆

◆

What tasks do I need to do?
What is the order of the tasks in a Data Mining Engagement?
What risks do I run?
Are there any “shortcuts” in my tasks?
What are the format of the deliverables that I need to resent to management?

Experienced Data Miners
◆
◆
◆
◆

Have I missed any activity?
Are there any tasks or activity that I can leave until later?
How can I make a Project Plan?
How can I document the project for later re-use?

12

Possible Future CRISP-DM Deliverables

◆

“CRISP-DM - The Book ”, includes
◆
◆
◆
◆
◆
◆
◆

Experiences, feedback from SIG members
Reference Model, User Guide updated with experiments
Full Deliverables/Document Templates
Case Studies
Mapping Advice from Generic to Specific Engagements
More explicit advice on Tools & Techniques
Advice on documentation of engagements,
establishment of Data Mining Library,…..

13

14



Source Exif Data:
File Type                       : PDF
File Type Extension             : pdf
MIME Type                       : application/pdf
PDF Version                     : 1.2
Linearized                      : No
Page Count                      : 14
Create Date                     : 1999:04:13 10:37:20
Producer                        : Acrobat Distiller 3.02
EXIF Metadata provided by EXIF.tools

Navigation menu