Web Scraping in Python using Beautiful Soup | Writing a Python program to Scrape IMDB website

Описание к видео Web Scraping in Python using Beautiful Soup | Writing a Python program to Scrape IMDB website

Web Scraping in Python using Beautiful Soup module. In this video I will explain how to perform web scraping in Python using Beautiful Soup and Requests module. I will write a python program to scrape the IMDB website and then load the desired data into an excel file. I will write this program from scratch so you can follow along.

Web Scraping is the process of programmatically extracting some data from a website. In Python, we can easy write program to scrape website by using the Beautiful Soup and Requests module. Requests module can be used to access the desired website and Beautiful Soup module can be used to parse the HTML source code of the website. Beautiful soup makes it very easy to parse HTML content and then provides multiple methods which can be used to extract the data from any HTML tags. All the website are written in HTML language hence in order to perform web scrapping, our program needs to read the HTML content. This is were Beautiful Soup comes into picture to make it very easy to access the contents of HTML using very simple methods.

You don’t need to be an expert in HTML to learn python web scrapping. You just need to know the most basic HTML which is to know what is a tag and how to identify the attributes associated with a tag.

In this video, in order to explain the concept of web scraping using BeautifulSoup in Python, we will be writing a python program which will access the IMDB website and then fetch the top rated movies present in the IMDB website and then load this data into an excel file.
IMDB website contains movie ratings but in this program we are only interested in scraping the top rated movies of all time only.

In order to load data into an excel file, I will be using openpyxl module. Using openpyxl, it is very easy to create a new excel file and then rename the sheet name and then load data into the excel file. I won’t be explaining openpyxl in detail but will just cover enough to know how to create an excel file and then load data into an excel file.

We will be installing Requests module and Beautiful Soup module using the pip installer. I am using mac so the command in mac to be run from terminal is “pip3 install module_name”. If you are using windows then you need to run the pip install command from command prompt using the command as “pip install module_name”.

🔴 WATCH MORE VIDEOS HERE 👇

✅ SQL Tutorial - Basic concepts:
   • SQL Tutorial - Basic concepts  

✅ SQL Tutorial - Intermediate concepts:
   • SQL Tutorial - Intermediate concepts  

✅ SQL Tutorial - Advance concepts:
   • SQL Tutorial - Advance concepts  

✅ Practice Solving Basic SQL Queries:
   • Practice Solving BASIC SQL Queries  

✅ Practice Solving Intermediate SQL Queries:
   • Practice Solving INTERMEDIATE SQL Que...  

✅ Practice Solving Complex SQL Queries:
   • Practice Solving COMPLEX SQL Queries  

✅ Data Analytics Career guidance:
   • Data Analytics career and more  

✅ SQL Course, SQL Training Platform Recommendations:
   • SQL Course / Training  

✅ Python Tutorial:
   • Python Tutorial  

✅ Git and GitHub Tutorial:
   • Git and GitHub  

✅ Data Analytics Projects:
   • Data Analytics Projects  

THANK YOU,
Thoufiq

Комментарии

Информация по комментариям в разработке