Releases: apify/crawlee
Releases · apify/crawlee
v3.9.2
v3.9.1
v3.9.0
3.9.0 (2024-04-10)
Bug Fixes
- include actual key in error message of KVS'
setValue
(#2411) (9089bf1) - notify autoscaled pool about newly added requests (#2400) (a90177d)
- puppeteer: allow passing
networkidle
towaitUntil
ingotoExtended
(#2399) (5d0030d), closes #2398 - sitemaps support
application/xml
(#2408) (cbcf47a)
Features
v3.8.2
3.8.2 (2024-03-21)
Bug Fixes
- core: solve possible dead locks in
RequestQueueV2
(#2376) (ffba095) - correctly report gzip decompression errors (#2368) (84a2f17)
- puppeteer: improve detection of older versions (98d4e86), closes #2370
- use 0 (number) instead of false as default for sessionRotationCount (#2372) (667a3e7)
Features
v3.8.1
v3.8.0
3.8.0 (2024-02-21)
Bug Fixes
createRequests
works correctly withexclude
(and nothing else) (#2321) (048db09)- puppeteer: add 'process' to the browser bound methods (#2329) (2750ba6)
- puppeteer: replace
page.waitForTimeout()
withsleep()
(52d7219), closes #2335 - puppeteer: support
puppeteer@v22
(#2337) (3cc360a)
Features
KeyValueStore.recordExists()
(#2339) (8507a65)- accessing crawler state, key-value store and named datasets via crawling context (#2283) (58dd5fc)
- adaptive playwright crawler (#2316) (8e4218a)
- add Sitemap.tryCommonNames to check well known sitemap locations (#2311) (85589f1), closes #2307
- core: add
userAgent
parameter toRobotsFile.isAllowed()
+RobotsFile.from()
helper (#2338) (343c159) - Support plain-text sitemap files (sitemap.txt) (#2315) (0bee7da)
v3.7.3
v3.7.2
v3.7.1
v3.7.0
3.7.0 (2023-12-21)
Bug Fixes
retryOnBlocked
doesn't override the blocked HTTP codes (#2243) (81672c3)- browser-pool: respect user options before assigning fingerpints (#2190) (f050776), closes #2164
- filter out empty globs (#2205) (41322ab), closes #2200
- make CLI work on Windows too with
--no-purge
(#2244) (83f3179) - make SessionPool queue up getSession calls to prevent overruns (#2239) (0f5665c), closes #1667
- MemoryStorage: lock request JSON file when reading to support multiple process crawling (#2215) (eb84ce9)