Releases: webrecorder/browsertrix-crawler
Releases · webrecorder/browsertrix-crawler
Browsertrix Crawler v1.2.4
Browsertrix Crawler v1.2.3
What's Changed
- Improved handling of pages that redirect back to the same page by @ikreymer in #635
- Fix pending requests causing timeout by @ikreymer in #636
Full Changelog: v1.2.2...v1.2.3
Browsertrix Crawler v1.2.2
What's Changed
- tests: reduce logging by @ikreymer in #596
- Dependency Update / 1.2.2 by @ikreymer in #633
- Bump Brave to 1.67.123, update puppeteer-core to latest
- Tests: simplifies extra hops test to avoid complex pages that could lead to timeout
Full Changelog: v1.2.1...v1.2.2
Browsertrix Crawler v1.2.1
What's Changed
- browser policies: disable restoring any tabs on startup + set new tab URL to about:blank by @ikreymer in #626
- Remove DISPLAY env var from image by @ikreymer in #625
- Don't filter saving redirect if no response body. by @ikreymer in #628
- Always download PDF + non HTML page cleanup + enterprise policy cleanup by @ikreymer in #629
Full Changelog: v1.2.0...v1.2.1
Browsertrix Crawler v1.2.0
What's Changed
- Bump version to 1.2.0 Beta + make draft release for each commit by @ikreymer in #582
- Always add warcinfo records to all WARCs by @ikreymer in #556
- Load non-HTML resources directly whenever possible by @ikreymer in #583
- base image version bump to brave 1.66.115 by @ikreymer in #592
- Add group policies, limit browser access to container filesystem by @vnznznz in #579
- cleanup dockerfile + fix test by @ikreymer in #595
- Consider disk usage of collDir instead of default /crawls by @benoit74 in #586
- add --dryRun flag and mode by @ikreymer in #594
- proxy: support setting proxy via --proxyServer, PROXY_SERVER env var or PROXY_HOST + PROXY_PORT env vars by @ikreymer in #589
- merge 1.1.4 -> 1.2.0 beta by @ikreymer in #611
- add EXPOSE for ports used inside container by @ikreymer in #612
- adjust browser viewport to avoid cutting off bottom of page by @ikreymer in #614
- clearer scope check by @ikreymer in #615
- http auth support per seed (supersedes #566): by @ikreymer in #616
- logging: log error message when seed is failed to be created by @ikreymer in #619
- add yarn.lock to Docker to ensure consistent builds! by @ikreymer in #621
- disable socat by default by @ikreymer in #622
- bump brave to 1.67.119 by @ikreymer in #620
- Updated rewriting for YouTube + dependency update by @ikreymer in #623
- 1.2.0 release - deps: bump wabac.js to 2.19.1, RWP for QA to 2.1.0 by @ikreymer in #624
Full Changelog: v1.1.4...v1.2.0
Browsertrix Crawler v1.2.0-beta.3
What's Changed
Full Changelog: v1.2.0-beta.2...v1.2.0-beta.3
Browsertrix Crawler v1.2.0-beta.2
What's Changed
- clearer scope check by @ikreymer in #615
- http auth support per seed (supersedes #566): by @ikreymer in #616
- logging: log error message when seed is failed to be created by @ikreymer in #619
- add yarn.lock to Docker to ensure consistent builds! by @ikreymer in #621
- disable socat by default by @ikreymer in #622
- bump brave to 1.67.119 by @ikreymer in #620
Full Changelog: v1.2.0-beta.1...v1.2.0-beta.2
Browsertrix Crawler v1.2.0-beta.1
What's Changed
- tests: fix blockrules tests by @ikreymer in #603
- recorder: add missing shouldSkip() to responseReceived callback by @ikreymer in #602
- Change some logged errors to warns by @tw4l in #600
- Fix synching extraSeeds state with multiple crawler instances by @ikreymer in #605
- dependency: update RWP to 2.0.1 by @ikreymer in #610
- Fix header newline escape by @ikreymer in #609
- merge 1.1.4 -> 1.2.0 beta by @ikreymer in #611
- add EXPOSE for ports used inside container by @ikreymer in #612
- adjust browser viewport to avoid cutting off bottom of page by @ikreymer in #614
Full Changelog: v1.2.0-beta.0...v1.2.0-beta.1
Browsertrix Crawler v1.1.4
What's Changed
- tests: fix blockrules tests by @ikreymer in #603
- recorder: add missing shouldSkip() to responseReceived callback by @ikreymer in #602
- Change some logged errors to warns by @tw4l in #600
- Fix synching extraSeeds state with multiple crawler instances by @ikreymer in #605
- dependency: update RWP to 2.0.1 by @ikreymer in #610
- add undici for 1.1.4 release, to fix #606 by @ikreymer in #608
- Fix header newline escape by @ikreymer in #609
Full Changelog: v1.1.3...v1.1.4
Browsertrix Crawler v1.2.0-beta.0
What's Changed
- Bump version to 1.2.0 Beta + make draft release for each commit by @ikreymer in #582
- Always add warcinfo records to all WARCs by @ikreymer in #556
- Load non-HTML resources directly whenever possible by @ikreymer in #583
- base image version bump to brave 1.66.115 by @ikreymer in #592
- Add group policies, limit browser access to container filesystem by @vnznznz in #579
- cleanup dockerfile + fix test by @ikreymer in #595
- Consider disk usage of collDir instead of default /crawls by @benoit74 in #586
- add --dryRun flag and mode by @ikreymer in #594
- proxy: support setting proxy via --proxyServer, PROXY_SERVER env var or PROXY_HOST + PROXY_PORT env vars by @ikreymer in #589
Full Changelog: v1.1.3...v1.2.0-beta.0