Queue Priority; Clear Queues; Always On Support
Pre-release
Pre-release
- Added ability to clear crawl queues by RequestId and Age, see
Krawler#removeUrlsByRootPage
andKrawler#removeUrlsByAge
- Added config option to prevent crawler shutdown on empty queues
- Added new single byte priority field to
KrawlQueueEntry
. Queues will always attempt to pop thelowest
priority
entry available. Priority can be assigned by overriding theKrawler#assignQueuePriorty
method. - Update dependencies