Better main loop (in Scrapy?) #8

gingerbeardman · 2020-01-06T11:15:46Z

Currently the main loop runs in a shell script, a file of URLs is consumed by a loop that runs Scrapy once per URL.

A better approach, which I failed to get working, would be to run scrape once and manage the loop and consumption of URLs in there. This should increase performance/speed considerably.

gingerbeardman added enhancement New feature or request help wanted Extra attention is needed labels Jan 6, 2020

gingerbeardman changed the title ~~Better main loop~~ Better main loop (in Scrapy?) Jan 6, 2020

gingerbeardman added the performance Performance bottlenecks label Jan 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better main loop (in Scrapy?) #8

Better main loop (in Scrapy?) #8

gingerbeardman commented Jan 6, 2020

Better main loop (in Scrapy?) #8

Better main loop (in Scrapy?) #8

Comments

gingerbeardman commented Jan 6, 2020