Attributes, Variables, features, Columns, Observations, Rows, Dependent Variables, Independent…

Member-only story

4 min readMar 21, 2023

Attributes, Variables, features, Columns, Observations, Rows, Dependent Variables, Independent Variables. What are they?

When you first go into data science, you will hear terminologies or words like Attributes, Variables, Features, Columns, Observations, Rows, Dependent Variables, Independent Variables… What are they? They are confusing.

For Data Mining process, we usually use CRISP DM data mining process:

Based on: https://www.datascience-pm.com/crisp-dm-2/

Data Mining process steps includes Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, Deployment.

Business Understanding step – we need to understand the business and establish the question we need to answer for the data mining

Data Understanding step – we need to understand the data. We can use statistics such as descriptive, regression analysis to understand the data.

Data Preparation step – it is the cleaning of the data and we can remove duplicates here.

Modeling step – we create clustering models, prediction models, classification models.

Written by Eric Goh Ming Hui

No responses yet