Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better main loop (in Scrapy?) #8

Open
gingerbeardman opened this issue Jan 6, 2020 · 0 comments
Open

Better main loop (in Scrapy?) #8

gingerbeardman opened this issue Jan 6, 2020 · 0 comments
Labels
enhancement New feature or request help wanted Extra attention is needed performance Performance bottlenecks

Comments

@gingerbeardman
Copy link
Owner

Currently the main loop runs in a shell script, a file of URLs is consumed by a loop that runs Scrapy once per URL.

A better approach, which I failed to get working, would be to run scrape once and manage the loop and consumption of URLs in there. This should increase performance/speed considerably.

@gingerbeardman gingerbeardman added enhancement New feature or request help wanted Extra attention is needed labels Jan 6, 2020
@gingerbeardman gingerbeardman changed the title Better main loop Better main loop (in Scrapy?) Jan 6, 2020
@gingerbeardman gingerbeardman added the performance Performance bottlenecks label Jan 6, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed performance Performance bottlenecks
Projects
None yet
Development

No branches or pull requests

1 participant