This notebook includes data scraping, for this beautifulsoup and selinium is used. It which takes a website URL as an input and extracts the information listed below as an output from that webpage.
- Specific HTML tags along with titles and meta description
- Extract specific tags, heading tags from h1-h6 along with titles and meta description
- Extracting ALT tags
- Counting words inside a web page
- Inspection of broken links inside a webpage
- Extracting the source code of the webpage in google colab
- Extracting all URLs from a website without duplication
- Measuring the forntend and backend performance of website