Skip to content

Releases: webrecorder/browsertrix-crawler

Browsertrix Crawler v1.2.4

12 Jul 02:55
Compare
Choose a tag to compare

What's Changed

  • Loosen selectors for login fields in automated profile creation by @tw4l in #638
  • bump replayweb.page to 2.1.1 by @ikreymer in #640
  • don't disable extraHops when using sitemaps: by @ikreymer in #639
  • bump browsertrix-behaviors to 0.6.2

Full Changelog: v1.2.3...v1.2.4

Browsertrix Crawler v1.2.3

09 Jul 18:05
Compare
Choose a tag to compare

What's Changed

  • Improved handling of pages that redirect back to the same page by @ikreymer in #635
  • Fix pending requests causing timeout by @ikreymer in #636

Full Changelog: v1.2.2...v1.2.3

Browsertrix Crawler v1.2.2

03 Jul 20:00
302b119
Compare
Choose a tag to compare

What's Changed

  • tests: reduce logging by @ikreymer in #596
  • Dependency Update / 1.2.2 by @ikreymer in #633
  • Bump Brave to 1.67.123, update puppeteer-core to latest
  • Tests: simplifies extra hops test to avoid complex pages that could lead to timeout

Full Changelog: v1.2.1...v1.2.2

Browsertrix Crawler v1.2.1

26 Jun 16:18
4495532
Compare
Choose a tag to compare

What's Changed

  • browser policies: disable restoring any tabs on startup + set new tab URL to about:blank by @ikreymer in #626
  • Remove DISPLAY env var from image by @ikreymer in #625
  • Don't filter saving redirect if no response body. by @ikreymer in #628
  • Always download PDF + non HTML page cleanup + enterprise policy cleanup by @ikreymer in #629

Full Changelog: v1.2.0...v1.2.1

Browsertrix Crawler v1.2.0

21 Jun 23:35
8af8b3c
Compare
Choose a tag to compare

What's Changed

  • Bump version to 1.2.0 Beta + make draft release for each commit by @ikreymer in #582
  • Always add warcinfo records to all WARCs by @ikreymer in #556
  • Load non-HTML resources directly whenever possible by @ikreymer in #583
  • base image version bump to brave 1.66.115 by @ikreymer in #592
  • Add group policies, limit browser access to container filesystem by @vnznznz in #579
  • cleanup dockerfile + fix test by @ikreymer in #595
  • Consider disk usage of collDir instead of default /crawls by @benoit74 in #586
  • add --dryRun flag and mode by @ikreymer in #594
  • proxy: support setting proxy via --proxyServer, PROXY_SERVER env var or PROXY_HOST + PROXY_PORT env vars by @ikreymer in #589
  • merge 1.1.4 -> 1.2.0 beta by @ikreymer in #611
  • add EXPOSE for ports used inside container by @ikreymer in #612
  • adjust browser viewport to avoid cutting off bottom of page by @ikreymer in #614
  • clearer scope check by @ikreymer in #615
  • http auth support per seed (supersedes #566): by @ikreymer in #616
  • logging: log error message when seed is failed to be created by @ikreymer in #619
  • add yarn.lock to Docker to ensure consistent builds! by @ikreymer in #621
  • disable socat by default by @ikreymer in #622
  • bump brave to 1.67.119 by @ikreymer in #620
  • Updated rewriting for YouTube + dependency update by @ikreymer in #623
  • 1.2.0 release - deps: bump wabac.js to 2.19.1, RWP for QA to 2.1.0 by @ikreymer in #624

Full Changelog: v1.1.4...v1.2.0

Browsertrix Crawler v1.2.0-beta.3

21 Jun 22:04
65a8635
Compare
Choose a tag to compare
Pre-release

What's Changed

  • Updated rewriting for YouTube + Instagram, dependency update by @ikreymer in #623

Full Changelog: v1.2.0-beta.2...v1.2.0-beta.3

Browsertrix Crawler v1.2.0-beta.2

21 Jun 03:12
Compare
Choose a tag to compare
Pre-release

What's Changed

Full Changelog: v1.2.0-beta.1...v1.2.0-beta.2

Browsertrix Crawler v1.2.0-beta.1

14 Jun 22:26
ac722cc
Compare
Choose a tag to compare
Pre-release

What's Changed

Full Changelog: v1.2.0-beta.0...v1.2.0-beta.1

Browsertrix Crawler v1.1.4

14 Jun 02:16
9094a83
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.1.3...v1.1.4

Browsertrix Crawler v1.2.0-beta.0

10 Jun 20:19
e2b4cc1
Compare
Choose a tag to compare
Pre-release

What's Changed

  • Bump version to 1.2.0 Beta + make draft release for each commit by @ikreymer in #582
  • Always add warcinfo records to all WARCs by @ikreymer in #556
  • Load non-HTML resources directly whenever possible by @ikreymer in #583
  • base image version bump to brave 1.66.115 by @ikreymer in #592
  • Add group policies, limit browser access to container filesystem by @vnznznz in #579
  • cleanup dockerfile + fix test by @ikreymer in #595
  • Consider disk usage of collDir instead of default /crawls by @benoit74 in #586
  • add --dryRun flag and mode by @ikreymer in #594
  • proxy: support setting proxy via --proxyServer, PROXY_SERVER env var or PROXY_HOST + PROXY_PORT env vars by @ikreymer in #589

Full Changelog: v1.1.3...v1.2.0-beta.0