Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include text which is within figures #98

Closed
jstockwin opened this issue Jun 23, 2020 · 0 comments · Fixed by #99
Closed

Include text which is within figures #98

jstockwin opened this issue Jun 23, 2020 · 0 comments · Fixed by #99

Comments

@jstockwin
Copy link
Owner

PDFMiner.six has a layout parameter, all_texts, which, if set to True, will also perform layout analysis on text within figures.

Doing this in py pdf parser does nothing, since we only look at text boxes. We should also include text from figures when all_texts=True.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant