Exploratory Data Analysis with PySpark using Diabetes Dataset

Описание к видео Exploratory Data Analysis with PySpark using Diabetes Dataset

Exploratory Data Analysis refers to the critical process of performing initial investigations on data so as to discover patterns ,to spot anomalies, to test hypothesis and to check assumptions with the help of summary statistics and graphical representations.

The datasets consists of several medical predictor variables and one target variable, Outcome. Predictor variables includes the number of pregnancies the patient has had, their BMI, insulin level, age, and so on.

Github: https://github.com/markumreed/data_sc...
p5.js Collection: https://editor.p5js.org/markumreed/co...
LinkedIn:   / data-science-for-everyone  

Комментарии

Информация по комментариям в разработке