I am scraping a main page that has a list of items. Within my pipeline I connect to a database to store the items. My next task is to go to each individual item page and scrape comments. I need to connect to the database again to see if I've already scraped the comments.
Is it more efficient for me connect to the database in the pipeline or in the crawl script?
Is there a way to return from the pipeline and tell the crawler to crawl the comments?