Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Edited pages rendered as if not changed #9129

Closed
nerkn opened this issue Nov 14, 2017 · 5 comments · Fixed by #11231
Closed

Edited pages rendered as if not changed #9129

nerkn opened this issue Nov 14, 2017 · 5 comments · Fixed by #11231

Comments

@nerkn
Copy link

nerkn commented Nov 14, 2017

Attach (recommended) or Link to PDF file here:
https://zenodo.org/record/1048152#.WgrUmd9fjeQ
or
https://waset.org/publications/10006645/red

Configuration:

  • Web browser and its version:
    mozilla ( under linux/windows),
  • Operating system and its version:
  • PDF.js version:
  • Is a browser extension:

Steps to reproduce the problem:
0. Our editors format papers, and our system using zend php, adding necessary information to header, footer, side notes.

  1. Open the document, there is no header, footer, side notes

What is the expected behavior? (add screenshot)
screencapture-waset-org-publications-10006645-r-1510672765952

What went wrong? (add screenshot)
screenshot-2017-11-14 microsoft word - 17ae030135 3 - red

Link to a viewer (if hosted on a site other than mozilla.github.io/pdf.js or as Firefox/Chrome extension):
expected:
https://waset.org/publication/Red/10006645
not working :
https://zenodo.org/record/1048152#.WgrUmd9fjeQ

@timvandermeij
Copy link
Contributor

timvandermeij commented Nov 14, 2017

The version of PDF.js hosted there is really old: 1.4.192. However, it also happens in the latest version where it gives some warnings:

Warning: Indexing all PDF objects  pdf.worker.js:342:5
PDF ff76551b2233684c9360d1a6b7172103 [1.6 Acrobat Distiller 7.0 (Windows) / PScript5.dll Version 5.2.2] (PDF.js: 2.0.114)  app.js:1083:6
downloadable font: OS/2: Adjusting head.macStyle (italic) to match fsSelection (font-family: "g_d1_f5" style:normal weight:normal stretch:normal src index:0) source: data:font/opentype;base64,AAEAAAANAIAAAwBQT1MvMqAsjSkAAADcAAAAVmNtYXBmcCauAAABNAAAAKxjdnQg/0SilwAAAeAAAAd4ZnBnbQFau5IAAAlYAAAFzW … dHUrKysrK0UYaUQrACtzKysrKysrKysrASsrKwBzc3MrKysBQ1xYuP/esx4cMxi4A3+1KzMYJiszuf/wA4GxEjMrKysrWQFzdCsAK3N1KytzKysrKytzdSsrKysrAA==  viewer.html
downloadable font: OS/2: Adjusting head.macStyle (italic) to match fsSelection (font-family: "g_d1_f8" style:normal weight:normal stretch:normal src index:0) source: data:font/opentype;base64,AAEAAAANAIAAAwBQT1MvMqAsjSkAAADcAAAAVmNtYXDLXhY1AAABNAAAAGRjdnQg/0SilwAAAZgAAAd4ZnBnbQFau5IAAAkQAAAFzW … Kyt0dSsrKysrRRhpRCsAK3MrKysrKysrKysBKysrAHNzcysrKwFDXFi4/96zHhwzGLgDf7UrMxgmKzO5//ADgbESMysrKytZAXN0KwArc3UrK3MrKysrK3N1KysrKysA

@Snuffleupagus
Copy link
Collaborator

Snuffleupagus commented Nov 15, 2017

I cannot reproduce the sanitizer errors in #9129 (comment), but the real problem with the PDF file is that it seems to be edited in a way that renders the resulting file corrupt (a good indication of this is the Warning: Indexing all PDF objects console warning).
From a quick look it appears that the PDF "file" is actually multiple PDF documents placed in just one file, which is a mode of corruption that can be difficult to recover from.

@nerkn
Copy link
Author

nerkn commented Dec 25, 2017

which libraries do you recommend to edit pdf files? Pyton, php, js in any language?

@timvandermeij
Copy link
Contributor

The sanitizer warnings are now gone, but the missing text problem remains.

@timvandermeij
Copy link
Contributor

Closing as a duplicate of #9552 since that one contains more information and a fixed PDF file for comparison.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants