dict.get is not a function #7303

danlester · 2016-05-09T10:11:01Z

There is a problem rendering this PDF:
ws_protectyourwork_e.pdf

Using the standard web viewer example (build/generic/web/viewer.html) it runs into an exception, showing the error:

PDF.js v1.5.232 (build: a682cce)
Message: dict.get is not a function

This happens in Chrome, Firefox, Safari on the Mac - haven't tried any others. Message on Safari is: dict.get is not a function. (In 'dict.get('Filter')', 'dict.get' is undefined)

I hope this helps. Not sure if there is a problem with the PDF but I guess it would ideally catch the error anyway. PDF seems to display OK in other software.

Thanks,

Dan

The text was updated successfully, but these errors were encountered:

Snuffleupagus · 2016-05-09T10:34:09Z

This is a unfortunately a regression from PR #5910.

It seems to me that we need a _much_ more robust way of trying to recover valid XRef data from concatenated PDF files, rather than just relying on a simple condition[1]. (I'm actually a little bit surprised that this hasn't caused more issues in practice.)

[1]

pdf.js/src/core/obj.js

Line 928 in d5c0008

if (typeof this.entries[m[1]] === 'undefined') {

yurydelendik · 2016-05-09T14:22:50Z

rather than just relying on a simple condition

@Snuffleupagus does it miss generation check? or shall we track the latest obj with specific number instead of first?

yurydelendik · 2016-05-09T14:25:44Z

PDF seems to display OK in other software.

Adobe Reader asking to re-save the opened PDF, this means PDF was corrupted and the Reader recovered it.

Snuffleupagus · 2016-05-09T14:33:46Z

does it miss generation check?

I don't think so, since off the top of my head all entries have gen === 0.
The problem here is that the PDF file is actually two separate PDF files placed in just one file, i.e. a completely busted PDF file (in the eyes of the specification).

or shall we track the latest obj with specific number instead of first?

I'm not sure how we can solve this in general, since in this case there are e.g. two distinct 76 0 obj entires, one in the "first" part of the file and one in the "second" part.

yurydelendik · 2016-05-09T14:53:19Z

Best solution will be to determine what the Reader does, I guess. However we need to understand how the file was created and intent of the generator. @danlester can you provide history of the PDF?

yurydelendik · 2016-05-09T15:02:31Z

I'm not sure how we can solve this in general, since in this case there are e.g. two distinct 76 0 obj entires, one from the "first" file and one from the "second one.

We shall take the one that is placed before the trailer that had catalog object reference. This means we shall not commit to found objects until next trailer is found (if not found it at all we just use what we found)

danlester · 2016-05-25T09:47:27Z

Sorry I missed your notification. I'm afraid I don't know much about the PDF history anyway - it wasn't my file originally. Will see if I can find anything out, but probably not.

Snuffleupagus added core pdf-broken regression labels May 9, 2016

yurydelendik added corrupted-pdf and removed pdf-broken labels May 9, 2016

Snuffleupagus mentioned this issue Oct 13, 2019

Allow over-writing entries, in XRef.indexObjects, only when the generation number matches (issues 11230, 11139, 9552, 9129, 7303) #11231

Merged

3 tasks

timvandermeij closed this as completed in #11231 Oct 17, 2019

timvandermeij removed core regression labels Oct 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dict.get is not a function #7303

dict.get is not a function #7303

danlester commented May 9, 2016

Snuffleupagus commented May 9, 2016 •

edited

Loading

yurydelendik commented May 9, 2016

yurydelendik commented May 9, 2016

Snuffleupagus commented May 9, 2016 •

edited

Loading

yurydelendik commented May 9, 2016

yurydelendik commented May 9, 2016 •

edited

Loading

danlester commented May 25, 2016

dict.get is not a function #7303

dict.get is not a function #7303

Comments

danlester commented May 9, 2016

Snuffleupagus commented May 9, 2016 • edited Loading

yurydelendik commented May 9, 2016

yurydelendik commented May 9, 2016

Snuffleupagus commented May 9, 2016 • edited Loading

yurydelendik commented May 9, 2016

yurydelendik commented May 9, 2016 • edited Loading

danlester commented May 25, 2016

Snuffleupagus commented May 9, 2016 •

edited

Loading

Snuffleupagus commented May 9, 2016 •

edited

Loading

yurydelendik commented May 9, 2016 •

edited

Loading