We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PDFMiner.six has a layout parameter, all_texts, which, if set to True, will also perform layout analysis on text within figures.
all_texts
True
Doing this in py pdf parser does nothing, since we only look at text boxes. We should also include text from figures when all_texts=True.
all_texts=True
The text was updated successfully, but these errors were encountered:
[loaders] Include text from figures when all_texts=True
026f849
Closes #98
b007258
14dbd7d
jstockwin
Successfully merging a pull request may close this issue.
PDFMiner.six has a layout parameter,
all_texts
, which, if set toTrue
, will also perform layout analysis on text within figures.Doing this in py pdf parser does nothing, since we only look at text boxes. We should also include text from figures when
all_texts=True
.The text was updated successfully, but these errors were encountered: