const crawler = new PuppeteerCrawler({ requestList, useSessionPool: true, persistCookiesPerSession: true, proxyConfiguration, requestHandler: router, requestHandlerTimeoutSecs: 100, headless: false, minConcurrency: 20, maxConcurrency: 30, launchContext: { launcher: PuppeteerExtra, useIncognitoPages: true }, })
useIncognitoPages: true
, only one session is used concurrently with one proxy. Is this how it should work? What is the point of having a session pool if only one is used?useIncognitoPages: true
. That way each page has its own proxy and everything is handled.useSessionPool
? With the below config Puppeteer is running on the same proxy.const crawler = new PuppeteerCrawler({ // useSessionPool: true, requestHandler: router, maxConcurrency: 2, headless: false, proxyConfiguration, requestList, }) await crawler.run()
useSessionPool: false, persistCookiesPerSession: false
otherwise I not sure how exactly it will end up with some other session settings and incognito pages, may be SDK will enforce cookies, may be not, never tried this way actually π To check in more details you can add some log output based on context.session