Minor improvements:
- Added a new optional variable
MAX_QUOTA
which allows you to control the maximum number of crawled pages per process instance, instead of per crawl. - Added a new "recently completed" pages to the stats of the crawl.
- Added a new
seo.h2s
analysis to theseo
library, which collects all h2 headers in a page.