Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug with error TypeError: 'PDFObjRef' object is not iterable #1082

Open
pprados opened this issue Feb 25, 2025 · 2 comments
Open

Bug with error TypeError: 'PDFObjRef' object is not iterable #1082

pprados opened this issue Feb 25, 2025 · 2 comments

Comments

@pprados
Copy link

pprados commented Feb 25, 2025

Bug report

With some PDF file, the code crash with TypeError: 'PDFObjRef' object is not iterable.

I propose a PR here

@pprados
Copy link
Author

pprados commented Feb 26, 2025

@pietermarsman or @euske

Can you integrate this PR and publish a new version?

pdfplumber crashes because of this. With LangChain, the leading LLM framework, it is not possible to use pdfminer or pdfplumber. Pdfplumber must force the use of an old version.

I want to help standardize the various parsers, but that's not possible without fixing this bug.

@bb-at-ss
Copy link

Yup. I'm still getting this bug too on Version: 20240706

TypeError: 'PDFObjRef' object is not iterable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants