Better main loop (in Scrapy?) #8
Labels
enhancement
New feature or request
help wanted
Extra attention is needed
performance
Performance bottlenecks
Currently the main loop runs in a shell script, a file of URLs is consumed by a loop that runs Scrapy once per URL.
A better approach, which I failed to get working, would be to run scrape once and manage the loop and consumption of URLs in there. This should increase performance/speed considerably.
The text was updated successfully, but these errors were encountered: