Member-only story

Eric Goh Ming Hui
6 min readMar 21, 2023

--

What is the difference between prediction, classification, clustering?

CRISP DM
What is CRISP DM? For Data Mining process, we usually use CRISP DM data mining process:

Based on: https://www.datascience-pm.com/crisp-dm-2/

Data Mining process steps includes Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, Deployment.

Business Understanding step – we need to understand the business and establish the question we need to answer for the data mining

Data Understanding step – we need to understand the data. We can use statistics such as descriptive, regression analysis to understand the data.

Data Preparation step – it is the cleaning of the data and we can remove duplicates here.

Modeling step – we create clustering models, prediction models, classification models.

Evaluation step – we evaluate which models is more accurate and select.

Deployment steps – we can create data products.

For Data Science, at the Deployment steps, we create data products for businesses. We can create softwares that predicts something. In the…

--

--

Eric Goh Ming Hui
Eric Goh Ming Hui

Written by Eric Goh Ming Hui

(G.Dip, M.Tech, eMBA) | Author of "Learn R for Applied Statistics" | Founder of SVBook Pte. Ltd. : http://svbook.great-site.net

No responses yet