-
-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use new browser-based archiving mechanism instead of pywb proxy #424
Merged
Merged
Changes from all commits
Commits
Show all changes
75 commits
Select commit
Hold shift + click to select a range
7524688
recorder work!
ikreymer af95ad9
remove dep
ikreymer ccb5549
fix
ikreymer 5e6a9d2
rewriting work, wait for requests to finish
ikreymer 31e2371
tweaks, attempt to determine issues on local build
ikreymer f0e648a
work
ikreymer 288d2cd
logging, skip 206
ikreymer d10a7e8
add concurrent
ikreymer 02c9755
move recording to recorder
ikreymer c8d2ffa
tweak logging
ikreymer 380da7f
logging improvements
ikreymer 6a610aa
refactor: also track Network events to get security details, wait for…
ikreymer 86117b6
use brave image
ikreymer 32b18e2
keep response data
ikreymer ee5804e
large files: add streaming to tmp dir in current collection
ikreymer 4a94c1d
stream WARC writing, fix dedup
ikreymer b7bb59b
logging: group network-related logging into separate call which can b…
ikreymer 53adc94
add separate async fetch handler separate from browser response strea…
ikreymer ba57a0c
async fetch work, check for empty response
ikreymer ba58c0d
pending reset:
ikreymer 8fe343d
streaming fix:
ikreymer dd07fc6
Merge branch 'unmark-pending-on-restart' into recorder-work
ikreymer c88ff5e
lower concurrency, add support for takeResponseBodyAsStream vs fetch,…
ikreymer 270c52c
update extraOpts, set max in mem to 10MB
ikreymer e4d5e54
fix --generateCDX to fix tests
ikreymer 7a387a9
fix streaming logic!
ikreymer c5f6fff
Merge branch 'main' into recorder-work
ikreymer dfa86ae
refactor into AsyncFetcher and ResponseStreamAsyncFetcher
ikreymer df8fbff
recorder: init dirs on load, init file on use
ikreymer af39d40
don't store partial records, always remove after async fetch
ikreymer 7d392e9
ensure filehandle is inited
ikreymer b010407
Merge branch 'main' into recorder-work
ikreymer 7af413e
deps: update to latest warcio.js serializer branch
ikreymer becc195
warcwriter: move writing to warcwriter
ikreymer f6e3551
add writeCdx
ikreymer 0f9ba83
Merge branch 'main' into recorder-work
ikreymer 3aad61a
Merge branch 'main' into recorder-work
ikreymer 361f765
Merge branch 'main' into recorder-work
ikreymer a359f3a
recorder: attempt to ensure serviceworkers are also captured:
ikreymer 134695b
browser: switch to latest chrome
ikreymer 40cba63
tweaks:
ikreymer 0ef763c
refactor:
ikreymer 6951fc0
disable expected size check if content-encoding is present!
ikreymer 107ac23
headers: ensure content-encoding and transfer-encoding are rewritten …
ikreymer e332ee4
Merge branch 'main' into recorder-work
ikreymer cf53a51
Merge branch 'main' into recorder-work
ikreymer ed127a9
update to latest warcio with stream-serializer
ikreymer 2420896
Merge branch 'main' into recorder-work
ikreymer 147a13b
update to warcio 2.2.0!
ikreymer 123762f
fix for warcio update: buffer takestream if read in memory for rewriting
ikreymer b35a33d
Merge branch 'main' into recorder-work
ikreymer 0985f96
recorder fixes:
ikreymer 5b7d46c
fix log msg
ikreymer b924ae1
fix typo in asyncLoad check!
ikreymer 60aec17
Merge branch 'main' into recorder-work
ikreymer 759f950
fix redirect handling:
ikreymer 529a3cd
error handling: better error detection for loadNetworkRespource() path
ikreymer 0925c3c
tweak error message
ikreymer 7b0de11
Merge branch 'main' into recorder-work
ikreymer 7415ac1
update yarn.lock
ikreymer 2e76fb4
revert to older version of puppeteer-core due to changes in accessing…
ikreymer 9f43f3c
fix header access
ikreymer a2b4f8b
add url to shouldSkip check, only include http/https URLs
ikreymer bc30d5a
Merge branch 'main' into recorder-work
ikreymer 6f07377
Merge branch 'main' to 'recorder-work', switching to Brave
ikreymer 6e9a1be
state: add pending-wait state when waiting for crawl to finish
ikreymer 0a1e6df
service worker handling fixes:
ikreymer 60cf313
reenable HEAD check + direct (non-browser) fetch of non-HTML pages:
ikreymer ccff712
Merge branch 'main' into recorder-work
ikreymer 53cfd39
Merge branch 'main' (0.12.1 release) into recorder-work
ikreymer e7a850c
Apply suggestions from code review, remove commented out code
ikreymer 988bf7a
remove unused code, remove references to pywb
ikreymer 034de9a
fix warcinfo test after version update
ikreymer 468a009
logging: reenable logging for timed out pending requests for now
ikreymer 868cd7a
remove pywb dependency
ikreymer File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will want to remove
pywb
as a option in the description for--logging
in argParser