FREE Data Science Resources — A Medium Table of Contents for my Data Science Articles
What is CRISP DM?
For Data Mining process, we usually use CRISP DM data mining process:
Based on: https://www.datascience-pm.com/crisp-dm-2/
Data Mining process steps includes Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, Deployment.
Business Understanding step – we need to understand the business and establish the question we need to answer for the data mining
Data Understanding step – we need to understand the data. We can use statistics such as descriptive, regression analysis to understand the data.
Data Preparation step – it is the cleaning of the data and we can remove duplicates here.
Modeling step – we create clustering models, prediction models, classification models.
Evaluation step – we evaluate which models is more accurate and select.
Deployment steps – we can create data products.
For Data Science, at the Deployment steps, we create data products for businesses. We can create softwares that predicts something. In the table, the goal is to build data products for a business.
For Data Mining, at the Deployment steps, we create reports or PowerPoints slides on our results. In the table, the goal is to extracting important information.
Read More on CRISP DM: http://edatascience.great-site.net/2023/03/08/what-is-crisp-dm/
Questions You may ask when you First Step into Data Science.
The following are some questions you may want to ask:
What are they? Data Science, Data Mining, Data Analysis, Big Data….
https://gohminghui88.medium.com/what-are-they-data-science-data-mining-text-mining-data-analysis-big-data-dc3f7db46f8
What is CRISP DM?
https://gohminghui88.medium.com/what-is-crisp-dm-d30416733019
What is Data Analysis Process?
https://gohminghui88.medium.com/what-is-data-analysis-process-84864779eb5
How to start a career in Data Science? Data Scientist, Data Analyst, Data Engineer, Machine Learning Engineer. What are they?
https://gohminghui88.medium.com/how-to-start-a-career-in-data-science-a7ccca6c075
Attributes, Variables, Features, Columns, Observations, Rows, Dependable Variables, Independent Variables. What are they?
https://gohminghui88.medium.com/attributes-variables-features-columns-observations-rows-dependent-variables-independent-82bf82ca195e
Categorical Variables and Numerical Variables. What are they?
https://gohminghui88.medium.com/categorical-variables-and-numerical-variables-what-are-they-d1f2cfe3b02e
Modeling and Evaluation: Explain Regression or Prediction and Classification using Simple Linear Regression using y = mx +c.
https://gohminghui88.medium.com/modeling-explain-using-simple-linear-regression-y-mx-c-many-people-explain-prediction-166a4c4f68b4
What is the difference between prediction, classification, clustering?
https://gohminghui88.medium.com/what-is-the-difference-between-prediction-classification-clustering-6946cd63cab2
Data Understanding and Data Analysis. What’s the Difference?
https://gohminghui88.medium.com/data-understanding-and-data-analysis-whats-the-difference-6ab2bc0cc96f
What are they? Dplyr, ggplot2, caret, RMarkDown… R Libraries
https://gohminghui88.medium.com/what-are-they-dplyr-ggplot2-caret-rmarkdown-r-libraries-84e3b00718f3
What are they? Pandas, MatPlotLib, Scikit Learn, Jupyter Notebook… Python Libraries.
https://gohminghui88.medium.com/what-are-they-pandas-matplotlib-scikit-learn-jupyter-notebook-python-libraries-6e75f9703436
Top 10 Data Science Tools
http://edatascience.great-site.net/2023/03/29/top-10-data-science-tools/
8 Programming Languages for Data Science
Data Certificates that is Low Cost
10 Data Science Books for Beginner
https://gohminghui88.medium.com/10-data-science-book-for-beginner-34d343f7d745
10 Data Science Certificate for Beginner
https://gohminghui88.medium.com/10-data-science-certificate-for-beginner-d88ad60089ba
R Programming Tutorial
R Programming Basics in 24 Hours
https://gohminghui88.medium.com/r-programming-basics-in-24-hours-162753d884d8
Data Preparation: Data Transformation and Data Preprocessing in R
https://gohminghui88.medium.com/data-transformation-and-data-preprocessing-in-r-b4672aa57acc
Data Understanding: Descriptive Statistics in R
https://gohminghui88.medium.com/data-understanding-descriptive-statistics-in-r-7c31fe8e8816
Data Understanding: Inferential Statistics in R
https://gohminghui88.medium.com/data-understanding-inferential-statistics-in-r-e2ab8bca5772
Data Understanding: Correlation and Regression Analysis
https://gohminghui88.medium.com/data-understanding-correlation-and-regression-analysis-4907550c0302
Data Understanding: Data Visualization in R
https://gohminghui88.medium.com/data-understanding-data-visualization-in-r-ab66b49fcdaf
Modeling and Evaluation: KNN in R
https://gohminghui88.medium.com/modeling-and-evaluation-knn-in-r-5239a27326c6
Modeling and Evaluation: Linear Regression in R
https://gohminghui88.medium.com/modeling-and-evaluation-linear-regression-in-r-0ad0f34eb210
Modeling: Neural Network in R
https://gohminghui88.medium.com/modeling-neural-network-in-r-196352f8058d
Modeling: Naive Bayes in R
https://gohminghui88.medium.com/modeling-naive-bayes-in-r-cf4ceb6914df
Deployment: RMarkdown in R
https://gohminghui88.medium.com/deployment-rmarkdown-in-r-6addad342ce3
If you want to learn more about data science, you can go to http://svbook.great-site.net/?i=1
SVBook Pte. Ltd. assists people to know What is Data Science, Text Analysis, Text Mining, Text Analytics, AI and Machine Learning, process of data, Data Mining to Insights Process using CRISP DM. Data Mining Process Steps includes Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, Deployment.
DSTK.Tech (https://dstk2.sourceforge.io/) creates tools and technologies for Data Science, develops open source tools for data science.
EMHAcademy (http://emhacademy.great-site.net/) offers courses to help people become Certified Data Scientist.
Thanks for readings.
Kind Regards,
Eric Goh
Buy me a cup of coffee:https://www.buymeacoffee.com/gohminghui