Modern Web Scraping with Python using Scrapy Splash Selenium
$0.99

Web Scraping nowadays has become one of the hottest topics, there are plenty of paid tools out there in the market that don’t show you anything how things are done as you will be always limited to their functionalities as a consumer.
In this course you won’t be a consumer anymore, i’ll teach you how you can build your own scraping tool ( spider ) using Scrapy.
What you’ll learn
- Understand the fundamentals of Web Scraping
- Scrape websites using Scrapy
- Understand Xpath & CSS Selectors
- Build a complete Spider from A to Z
- Store the extracted Data in MongoDb & SQLite3
- Scrape JavaScript websites using Splash & Selenium
- Build a CrawlSpider
- Understand the Crawling behavior
- Build a custom Middleware
- Web Scraping best practices
- Avoid getting banned while scraping websites
- Bypass cloudflare
- Scrape APIs
- Scrape infinite scroll websites
- Working with Cookies
- Deploy spiders locally and to the cloud
- Run spiders periodically
- Prevent storing duplicated data
- Build datasets
- Login to websites using Scrapy
- Download images and files using Scrapy
Course content
1818 sections • 128 lectures • 8h 50m total length
Introduction
Scrapy Fundamentals
XPath expressions & CSS Selectors
Project 1 Spiders from A to Z
Building Datasets
Project 2 Dealing with Multiple pages
Debugging spiders
Let's take a break !
Project 3 Build Crawlers using Scrapy
Splash crash course
Project 4 Scraping JavaScript websites using Splash
Project 5 Scraping JavaScript websites using Selenium
Working with Pipelines
Scraping APIs (NEW)
Log in to websites (NEW)
Project 6 Bypass Cloudflare
APPENDIX (OLDER SCRAPY 1.5 CONTENT)
BONUS