Learn Before
Concept
Scrapy Spider Scraping Cycle
In general, the scraping cycle for spiders follows this basic pattern:
- Initial Requests are generated along with the function that should be called on those Requests (callback function)
- The callback function parses responses and returns item objects, Request objects, or an iterable of either aforementioned objects
- In the callback functions, page contents are parsed to generate data
- Items returned by the functions are added to a database or written to a file
0
1
Updated 2021-06-15
Tags
Python Programming Language
Data Science