Learn Before
Concept

Scrapy Spider Scraping Cycle

In general, the scraping cycle for spiders follows this basic pattern:

  • Initial Requests are generated along with the function that should be called on those Requests (callback function)
  • The callback function parses responses and returns item objects, Request objects, or an iterable of either aforementioned objects
  • In the callback functions, page contents are parsed to generate data
  • Items returned by the functions are added to a database or written to a file

0

1

Updated 2021-06-15

Tags

Python Programming Language

Data Science

Related