Build ML Pipelines using SparkML in PySpark | Python | Google Colab

Описание к видео Build ML Pipelines using SparkML in PySpark | Python | Google Colab

In this video, I will show you how to do build Machine Learning pipelines in PySpark using SparkML on Google Colab. Below are the contents of this video:

1. Preprocessing data using SparkML
2. Modeling using SparkML
3. Prediction on Test data
4. Building ML pipelines

Notes: Transformer will call only transform() method and the resulting data frame will be passed to next stage. For Estimator, it will call fit() method, which returns a model and then transform() method will be called to create the output data frame.

Link to the previous video on "Modeling in PySpark using Spark ML":    • Modeling in PySpark using Spark ML on...  

Link to the playlist "Getting started with PySpark" :    • Getting started with PySpark  

Link to "Setting up the PySpark environment on Google Colab" video:    • Setting up the PySpark environment on...  

Link to the GitHub repo: https://github.com/Abhishekmamidi123/...

Check out my "Data Science guide for freshers and enthusiasts" playlist:    • My path to becoming a Data Scientist ...  
I have put my 3 years of learning experience into this playlist.

Please do like, share and subscribe to this channel and share this video with your friends. Keep learning :)

Follow me here:
LinkedIn:   / abhishekmamidi  
Blog: https://www.abhishekmamidi.com/
GitHub: http://github.com/Abhishekmamidi123
Kaggle: http://www.kaggle.com/abhishekmamidi

Tags:
abhishek mamidi, data science, machine learning, deep learning, artificial intelligence, internship, career, college, job, experience, krish naik, ai engineering, fresher, data science enthusiasts, pyspark, apache spark, python, pysparkling

Комментарии

Информация по комментариям в разработке