Install BeautifulSoup and lxml ,
Open terminal and write
pip install beautifulsoup4 pip install lxml
The scraper is written in Python and uses lxml to parse web pages. BeautifulSoup is used to extract data from HTML and XML files.
Film-related emotions
There are 8 classes of emotions that would be efficient for classifying text ... These are: Anger, Expectation, Disgust, Fear, Joy, Sad, Surprise, Trust . Here they are taken as input and the corresponding movies will be displayed for emotion.
The correspondence of each emotion to the genre of cinema is listed below:
Garden — Drama
Disgust — Musical
Anger — Family
Waiting — Thriller
Fear — Sports
Enjoyment — Thriller
Trust — Western
Surprise — Film Noir
Based on emotions, the appropriate genre will be selected, and all 5 best films of this genre will be recommended to the user.
|
This script will scrape all movie titles in the genre corresponding to the entered emotion and make a list for the user.
Web Scraping is very useful for extracting data and analyzing it. Without Web Cleaning, the Internet, as you know, would not really exist. This is because Google and other major search engines use a sophisticated web scraper to extract content to be included in their index. These tools make search engines possible.
Application crawling
- Article retrieval for sites that curate content.
- Business list retrieval for companies that create databases of leads.
- Many different types of data extraction, sometimes referred to as data extraction. For example, one of the popular and sometimes controversial uses of the web scraper — This is a reduction in airline prices for publication on airfare comparison sites.