Skip to content

Releases: apify/crawlee

v3.7.1

02 Jan 08:29
Compare
Choose a tag to compare

3.7.1 (2024-01-02)

Bug Fixes

  • ES2022 build compatibility and move to NodeNext for module (#2258) (7fe1e68), closes #2257

v3.7.0

21 Dec 16:11
Compare
Choose a tag to compare

3.7.0 (2023-12-21)

Bug Fixes

  • retryOnBlocked doesn't override the blocked HTTP codes (#2243) (81672c3)
  • browser-pool: respect user options before assigning fingerpints (#2190) (f050776), closes #2164
  • filter out empty globs (#2205) (41322ab), closes #2200
  • make CLI work on Windows too with --no-purge (#2244) (83f3179)
  • make SessionPool queue up getSession calls to prevent overruns (#2239) (0f5665c), closes #1667
  • MemoryStorage: lock request JSON file when reading to support multiple process crawling (#2215) (eb84ce9)

Features

v3.6.2

26 Nov 12:36
Compare
Choose a tag to compare

3.6.2 (2023-11-26)

Bug Fixes

  • prevent race condition in KeyValueStore.getAutoSavedValue() (#2193) (e340e2b)

v3.6.1

15 Nov 18:56
Compare
Choose a tag to compare

3.6.1 (2023-11-15)

Bug Fixes

  • ts: ignore import errors for got-scraping (012fc9e)
  • ts: specify type explicitly for logger (aec3550)

Features

v3.6.0

15 Nov 15:20
Compare
Choose a tag to compare

3.6.0 (2023-11-15)

Bug Fixes

  • add skipNavigation option to enqueueLinks (#2153) (118515d)
  • BrowserPool: ignore --no-sandbox flag for webkit launcher (#2148) (1eb2f08), closes #1797
  • core: respect some advanced options for RequestList.open() + improve docs (#2158) (c5a1b07)
  • declare missing dependency on got-scraping in the core package (cd2fd4d)
  • provide more detailed error messages for browser launch errors (#2157) (f188ebe)
  • retry incorrect Content-Type when response has blocked status code (#2176) (b54fb8b), closes #1994

Features

v3.5.8

17 Oct 10:20
Compare
Choose a tag to compare

3.5.8 (2023-10-17)

Bug Fixes

  • MemoryStorage: ignore invalid files for request queues (#2132) (fa58581), closes #1985
  • refactor extractUrls to split the text line by line first (#2122) (7265cd7)

v3.5.7

05 Oct 09:03
Compare
Choose a tag to compare

3.5.7 (2023-10-05)

Bug Fixes

  • add warning when we detect use of RL and RQ, but RQ is not provided explicitly (#2115) (6fb1c55), closes #1773
  • ensure the status message cannot stuck the crawler (#2114) (9034f08)
  • RQ request count is consistent after migration (#2116) (9ab8c18), closes #1855 #1855

v3.5.6

04 Oct 10:31
Compare
Choose a tag to compare

3.5.6 (2023-10-04)

Bug Fixes

  • types: re-export RequestQueueOptions as an alias to RequestProviderOptions (#2109) (0900f76)

Features

v3.5.5

02 Oct 13:02
Compare
Choose a tag to compare

3.5.5 (2023-10-02)

Bug Fixes

  • allow to use any version of puppeteer or playwright (#2102) (0cafceb), closes #2101
  • session pool leaks memory on multiple crawler runs (#2083) (b96582a), closes #2074 #2031
  • templates: install browsers on postinstall for playwright (#2104) (323768b)
  • types: make return type of RequestProvider.open and RequestQueue(v2).open strict and accurate (#2096) (dfaddb9)

Features

  • experimental support for request locking (Request Queue v2) (#1975) (70a77ee), closes #1365

v3.5.4

11 Sep 13:23
Compare
Choose a tag to compare

3.5.4 (2023-09-11)

Bug Fixes

  • core: allow explicit calls to purgeDefaultStorage to wipe the storage on each call (#2060) (4831f07)
  • various helpers opening KVS now respect Configuration (#2071) (59dbb16)

Features

  • remove side effect from the deprecated error context augmentation (#2069) (f9fb5c4)