Crisp Dm User Guide
User Manual: Pdf
Open the PDF directly: View PDF .
Page Count: 14
Download | ![]() |
Open PDF In Browser | View PDF |
The CRISP-DM User Guide Brussels SIG Meeting Pete Chapman NCR Systems Engineering Copenhagen email: Pete.Chapman@Copenhagen.NCR.com 1 Agenda ■ CRISP-DM Objectives and Benefits ■ CRISP-DM Deliverables ■ CRISP-DM Methodology, Phases and Tasks ■ CRISP-DM User Guide ■ Possible CRISP-DM Futures 2 Objectives and Benefits of CRISP-DM ◆ ensure quality of knowledge discovery project results ◆ reduce skills required for knowledge discovery ◆ reduce costs and time ◆ general purpose (i.e., stable across varying applications) ◆ robust (i.e., insensitive to changes in the environment) ◆ tool and technique independent ◆ tool supportable ◆ support documentation of projects ◆ capture experience for reuse ◆ support knowledge transfer and training 3 CRISP-DM Deliverables ◆ Process Model ◆ ◆ ◆ ◆ ◆ Tool Support ◆ ◆ ◆ Methodology Reference Model User Guide Output (Deliverable/Templates) Tool Support Definitions Stream Library Experimentation ◆ ◆ Experimentation Reports CRISP-DM SIG User Feedback 4 CRISP-DM Methodology Phases CRISP Process Model Generic Tasks Mapping Specialized Tasks Process Instances CRISP Process 5 Data Mining Contexts Generic Tasks Application Domains Problem Types Technical Aspects Tools and Techniques • • • • • • • • • • • • • • • Response Modeling Churn Prediction ... Data Description / Summarization Segmentation Concept Description Predictive Modeling Dependency Analysis Missing Values Outliers ... Clementine MineSet Decision Trees ... Specialized Tasks 6 CRISP-DM Phases Business Understanding Data Understanding Data Preparation Deployment Data Data Data Modelling Evaluation 7 Phases and Tasks Business Understanding Determine Business Objectives Background Business Objectives Business Success Criteria Data Understanding Collect Initial Data Initial Data Collection Report Data Preparation Modeling Data Set Data Set Description Select Modeling Technique Modeling Technique Modeling Assumptions Evaluation Evaluate Results Assessment of Data Mining Results w.r.t. Select Data Business Success Describe Data Rationale for Inclusion / Criteria Data Description Report Exclusion Generate Test Design Approved Models Test Design Situation Assessment Explore Data Clean Data Review Process Inventory of Resources Data Exploration Report Data Cleaning Report Review of Process Build Model Requirements, Parameter Settings Assumptions, and Verify Data Quality Construct Data Models Determine Next Steps Constraints Data Quality Report Derived Attributes List of Possible Actions Model Description Risks and Contingencies Generated Records Decision Terminology Assess Model Costs and Benefits Integrate Data Model Assessment Merged Data Revised Parameter Determine Settings Data Mining Goal Format Data Data Mining Goals Reformatted Data Data Mining Success Criteria Deployment Plan Deployment Deployment Plan Plan Monitoring and Maintenance Monitoring and Maintenance Plan Produce Final Report Final Report Final Presentation Review Project Experience Documentation Produce Project Plan Project Plan Initial Asessment of Tools and Techniques 8 Introduction to the User Guide Reference Model What To Do? Generic Tasks Context User Guide How To Do? • check lists • questionaires • tools • sequences of steps • decision points • pitfalls Specialized Tasks 9 CRISP-DM User Guide 10 How to use the User Guide (i) ◆ Contents of the User Guide - More detailed description of the various tasks using: ◆ ◆ ◆ ◆ ◆ Activities List Check Lists Good Ideas Warnings! What is NOT in the User Guide ◆ ◆ ◆ ◆ Deliverables/Document Templates (as yet) Description of Techniques and Tools (as yet) Estimates of engagements Quality Indicators 11 How to use the User Guide (ii) ◆ Beginning Data Miners ◆ ◆ ◆ ◆ ◆ ◆ What tasks do I need to do? What is the order of the tasks in a Data Mining Engagement? What risks do I run? Are there any “shortcuts” in my tasks? What are the format of the deliverables that I need to resent to management? Experienced Data Miners ◆ ◆ ◆ ◆ Have I missed any activity? Are there any tasks or activity that I can leave until later? How can I make a Project Plan? How can I document the project for later re-use? 12 Possible Future CRISP-DM Deliverables ◆ “CRISP-DM - The Book ”, includes ◆ ◆ ◆ ◆ ◆ ◆ ◆ Experiences, feedback from SIG members Reference Model, User Guide updated with experiments Full Deliverables/Document Templates Case Studies Mapping Advice from Generic to Specific Engagements More explicit advice on Tools & Techniques Advice on documentation of engagements, establishment of Data Mining Library,….. 13 14
Source Exif Data:
File Type : PDF File Type Extension : pdf MIME Type : application/pdf PDF Version : 1.2 Linearized : No Page Count : 14 Create Date : 1999:04:13 10:37:20 Producer : Acrobat Distiller 3.02EXIF Metadata provided by EXIF.tools