Skip to content

Releases: apify/crawlee

v3.3.0

09 Mar 09:17
Compare
Choose a tag to compare

3.3.0 (2023-03-09)

Bug Fixes

  • add proxyUrl to DownloadListOfUrlsOptions (779be1e), closes #1780
  • CheerioCrawler: pass ixXml down to response parser (#1807) (af7a5c4), closes #1794
  • ignore invalid URLs in enqueueLinks in browser crawlers (#1803) (5ac336c)
  • MemoryStorage: request queues race conditions causing crashes (#1806) (083a9db), closes #1792
  • MemoryStorage: RequestQueue should respect forefront (#1816) (b68e86a), closes #1787
  • MemoryStorage: RequestQueue#handledRequestCount should update (#1817) (a775e4a), closes #1764

Features

v3.2.2

08 Feb 18:53
Compare
Choose a tag to compare

3.2.2 (2023-02-08)

Bug Fixes

  • MemoryStorage: request queues saved in the wrong place (#1779) (19409db)

v3.2.1

07 Feb 11:49
Compare
Choose a tag to compare

3.2.1 (2023-02-07)

Bug Fixes

  • add QueueOperationInfo export to the core package (5ec6c24)

v3.2.0

07 Feb 08:28
Compare
Choose a tag to compare

3.2.0 (2023-02-07)

Bug Fixes

  • allow userData option in enqueueLinksByClickingElements (#1749) (736f85d), closes #1617
  • clone request.userData when creating new request object (#1728) (222ef59), closes #1725
  • Correctly compute pendingRequestCount in request queue (#1765) (946535f)
  • declare missing dependency on tslib (27e96c8), closes #1747
  • ensure CrawlingContext interface is inferred correctly in route handlers (aa84633)
  • KeyValueStore: big buffers should not crash (#1734) (2f682f7), closes #1732 #1710
  • memory-storage: dont fail when storage already purged (#1737) (8694027), closes #1736
  • update playwright to 1.29.2 and make peer dep. less strict (#1735) (c654fcd), closes #1723
  • utils: add missing dependency on ow (bf0e03c), closes #1716

Features

  • add forefront option to all enqueueLinks variants (#1760) (a01459d), closes #1483
  • enqueueLinks: add SameOrigin strategy and relax protocol matching for the other strategies (#1748) (4ba982a)
  • MemoryStorage: read from fs if persistStorage is enabled, ram only otherwise (#1761) (e903980)

v3.1.4

14 Dec 15:45
Compare
Choose a tag to compare

3.1.4 (2022-12-14)

Bug Fixes

v3.1.3

07 Dec 14:26
Compare
Choose a tag to compare

3.1.3 (2022-12-07)

Bug Fixes

Features

  • always show error origin if inside the userland (#1677) (bbe9045)
  • hideInternalConsole in JSDOMCrawler (#1707) (8975f90)

v3.1.2

15 Nov 08:57
Compare
Choose a tag to compare

3.1.2 (2022-11-15)

Bug Fixes

  • injectJQuery in context does not survive navs (#1661) (493a7cf)
  • make router error message more helpful for undefined routes (#1678) (ab359d8)
  • MemoryStorage: correctly respect the desc option (#1666) (b5f37f6)
  • requestHandlerTimeout timing (#1660) (493ea0c)
  • shallow clone browserPoolOptions before normalization (#1665) (22467ca)
  • support headfull mode in playwright js project template (ea2e61b)
  • support headfull mode in puppeteer js project template (e6aceb8)

Features

v3.1.1

07 Nov 13:01
Compare
Choose a tag to compare

3.1.1 (2022-11-07)

Bug Fixes

Features

  • add static set and useStorageClient shortcuts to Configuration (2e66fa2)
  • enable migration testing (#1583) (ee3a68f)
  • playwright: disable animations when taking screenshots (#1601) (4e63034)

v3.1.0

13 Oct 12:14
Compare
Choose a tag to compare

3.1.0 (2022-10-13)

Bug Fixes

  • add overload for KeyValueStore.getValue with defaultValue (#1541) (e3cb509)
  • add retry attempts to methods in CLI (#1588) (9142e59)
  • allow label in enqueueLinksByClickingElements options (#1525) (18b7c25)
  • basic-crawler: handle request.noRetry after errorHandler (#1542) (2a2040e)
  • build storage classes by using this instead of the class (#1596) (2b14eb7)
  • correct some typing exports (#1527) (4a136e5)
  • do not hide stack trace of (retried) Type/Syntax/ReferenceErrors (469b4b5)
  • enqueueLinks: ensure the enqueue strategy is respected alongside user patterns (#1509) (2b0eeed)
  • enqueueLinks: prevent useless request creations when filtering by user patterns (#1510) (cb8fe36)
  • export Cookie from crawlee metapackage (7b02ceb)
  • handle redirect cookies (#1521) (2f7fc7c)
  • http-crawler: do not hang on POST without payload (#1546) (8c87390)
  • remove undeclared dependency on core package from puppeteer utils (827ae60)
  • support TypeScript 4.8 (#1507) (4c3a504)
  • wait for persist state listeners to run when event manager closes (#1481) (aa550ed)

Features

v3.0.4

22 Aug 20:14
Compare
Choose a tag to compare

3.0.4 (2022-08-22)

Features

  • bump puppeteer support to 15.1

Bug Fixes

  • key value stores emitting an error when multiple write promises ran in parallel (#1460) (f201cca)
  • fix dockerfiles in project templates