Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync with fork #513

Closed
wants to merge 537 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
537 commits
Select commit Hold shift + click to select a range
60ac6ae
Merge pull request #161 from py-pdf/dependabot/pip/rich-13.9.2
dependabot[bot] Oct 8, 2024
0b3e495
Merge pull request #160 from py-pdf/dependabot/github_actions/pypa/gh…
dependabot[bot] Oct 8, 2024
d699f74
Bump nox from 2024.4.15 to 2024.10.9 in /.github/workflows
dependabot[bot] Oct 10, 2024
0fc12ee
Bump distlib from 0.3.8 to 0.3.9
dependabot[bot] Oct 10, 2024
ff597a9
Bump charset-normalizer from 3.3.2 to 3.4.0
dependabot[bot] Oct 10, 2024
1ea386d
Merge pull request #172 from py-pdf/dependabot/pip/charset-normalizer…
dependabot[bot] Oct 10, 2024
21d7637
Merge pull request #170 from py-pdf/dependabot/pip/dot-github/workflo…
dependabot[bot] Oct 10, 2024
b57e488
Merge pull request #171 from py-pdf/dependabot/pip/distlib-0.3.9
dependabot[bot] Oct 10, 2024
8959f15
Bump sphinx from 8.0.2 to 8.1.0 in /docs
dependabot[bot] Oct 11, 2024
710f700
Merge pull request #173 from py-pdf/dependabot/pip/docs/sphinx-8.1.0
dependabot[bot] Oct 11, 2024
dfa29d0
Silence S311 Error
bosd Oct 12, 2024
cd064d5
Bump sphinx from 8.1.0 to 8.1.3 in /docs
dependabot[bot] Oct 14, 2024
a07afa6
Merge pull request #180 from py-pdf/dependabot/pip/docs/sphinx-8.1.3
dependabot[bot] Oct 14, 2024
aa61d3a
[FIX] Image conversion Fallback
bosd Oct 13, 2024
4b86f60
[IMP] Add conversion exception message to feedback
bosd Oct 13, 2024
2f72d94
Fix Flake8 warning on test_invalid_url
bosd Oct 14, 2024
84692c8
Rebrand Image based error message
bosd Oct 8, 2024
4c6720e
[REF] np.isclose to math.isclose
bosd Oct 8, 2024
51e7c2a
[IMP] CLI Documentation
bosd Oct 14, 2024
59e21b1
New test for matplotlib importerror
Jul 19, 2021
e388225
Pre-commit fixes
bosd Oct 14, 2024
e088458
Bump mypy from 1.11.2 to 1.12.0
dependabot[bot] Oct 15, 2024
ffaec76
Merge pull request #186 from py-pdf/dependabot/pip/mypy-1.12.0
dependabot[bot] Oct 15, 2024
09b1361
[REF] Compute accuracy
bosd Oct 12, 2024
bab7a03
Bump setuptools from 75.1.0 to 75.2.0
dependabot[bot] Oct 17, 2024
44d725c
Merge pull request #188 from py-pdf/dependabot/pip/setuptools-75.2.0
dependabot[bot] Oct 17, 2024
501b907
[REF] silence download_url S310,add typing
bosd Oct 12, 2024
c5f3020
Activate mypy setting and run on gh actions
bosd Oct 17, 2024
dee507b
Add typing_extensions for <3.10 compatability
bosd Oct 19, 2024
472fa1e
Typing fixes in core.py
bosd Oct 18, 2024
7345162
Typing fixes in handlers.py
bosd Oct 18, 2024
90b48d4
Typing fixes in utils.py
bosd Oct 18, 2024
62b6cf3
[FIX] Warning on test cli quiet
bosd Oct 17, 2024
4310cf3
Update Lockfile
bosd Oct 19, 2024
d6a8c2d
[REF] split_textline
bosd Oct 12, 2024
25f08cc
[REF] _group_and_process_chars
bosd Oct 19, 2024
d444719
[REF] get_index_closest_point
bosd Oct 12, 2024
e9e4dbe
Fix S310 Audit url open for permitted schemes.
bosd Oct 19, 2024
a54a500
[REF] Scale Image
bosd Oct 19, 2024
cecc283
[REF] Flag_fontsize
bosd Oct 19, 2024
ff9a501
Fix test_image_warning test
bosd Oct 19, 2024
68695b2
[REF] Fix B028
bosd Oct 20, 2024
d2102eb
[REF] lattice -reduce_index
bosd Oct 20, 2024
6e10d91
[REF]: copy_spanning_text
bosd Oct 20, 2024
c4659ca
[REF]: get_table_index
bosd Oct 20, 2024
67a8c22
[REF]: Find_closest_tls
bosd Oct 20, 2024
df513ef
[REF]: get_table_areas
bosd Oct 20, 2024
bc3f9af
[REF] search_table_body
bosd Oct 19, 2024
4451fa9
Pre-commit fix
bosd Oct 20, 2024
bcf0e44
Bump cryptography from 43.0.1 to 43.0.3
dependabot[bot] Oct 21, 2024
336c2d4
Bump virtualenv from 20.26.6 to 20.27.0 in /.github/workflows
dependabot[bot] Oct 21, 2024
3b2471b
Merge pull request #208 from py-pdf/dependabot/pip/cryptography-43.0.3
dependabot[bot] Oct 21, 2024
242039a
Bump mypy from 1.12.0 to 1.12.1
dependabot[bot] Oct 21, 2024
bbdc3c3
Merge pull request #209 from py-pdf/dependabot/pip/dot-github/workflo…
dependabot[bot] Oct 21, 2024
75712ff
Merge pull request #207 from py-pdf/dependabot/pip/mypy-1.12.1
dependabot[bot] Oct 21, 2024
c59c868
Bump virtualenv from 20.26.6 to 20.27.0
dependabot[bot] Oct 21, 2024
76c8c65
Merge pull request #206 from py-pdf/dependabot/pip/virtualenv-20.27.0
dependabot[bot] Oct 21, 2024
3d83b22
Flake8 fixes base parser
bosd Oct 20, 2024
eeee3fe
Bump pydata-sphinx-theme from 0.15.4 to 0.16.0 in /docs
dependabot[bot] Oct 23, 2024
ff1f71e
Bump rich from 13.9.2 to 13.9.3
dependabot[bot] Oct 23, 2024
f3b7ced
Bump mypy from 1.12.1 to 1.13.0
dependabot[bot] Oct 23, 2024
38bf3c9
Merge pull request #214 from py-pdf/dependabot/pip/mypy-1.13.0
dependabot[bot] Oct 23, 2024
7db26f8
Merge pull request #212 from py-pdf/dependabot/pip/docs/pydata-sphinx…
dependabot[bot] Oct 23, 2024
e8c74dd
Merge pull request #213 from py-pdf/dependabot/pip/rich-13.9.3
dependabot[bot] Oct 23, 2024
3d48f20
Bump safety-schemas from 0.0.5 to 0.0.7
dependabot[bot] Oct 24, 2024
73bd23b
Merge pull request #215 from py-pdf/dependabot/pip/safety-schemas-0.0.7
dependabot[bot] Oct 24, 2024
ea83589
[REF]: find lines
bosd Oct 20, 2024
5b44b78
[REF]: Remove old opencv api compatability
bosd Oct 21, 2024
e1b16b0
[REF] Search header, closest_above
bosd Oct 23, 2024
64a83c9
[REF]: Network parser search header -> merge_zones and extract_zones
bosd Oct 24, 2024
1c18a4f
[REF]: network parser generate_table_bbox -> split into mark_processe…
bosd Oct 22, 2024
efa7f43
Bump safety-schemas from 0.0.7 to 0.0.8
dependabot[bot] Oct 25, 2024
71d1e61
Bump safety from 3.2.8 to 3.2.9
dependabot[bot] Oct 25, 2024
1f71b07
Updated pypdf dependency to include v5
snanda85 Oct 25, 2024
fa4b7d9
Fixes custom backend functionality
snanda85 Oct 25, 2024
b60bebe
Fixed typeguard typecheck
snanda85 Oct 25, 2024
bb6ef73
pre-commit Fixups
bosd Oct 26, 2024
a29a8cd
add docstring to get_backend
bosd Oct 26, 2024
883aec0
Merge pull request #218 from py-pdf/dependabot/pip/safety-schemas-0.0.8
dependabot[bot] Oct 26, 2024
1fa918f
Merge pull request #220 from py-pdf/dependabot/pip/safety-3.2.9
dependabot[bot] Oct 26, 2024
21d2052
Network parser Fix B903
bosd Oct 24, 2024
c0f54d9
Bump pydantic from 2.5.3 to 2.9.2
dependabot[bot] Oct 26, 2024
9b1d9af
Merge pull request #219 from py-pdf/dependabot/pip/pydantic-2.9.2
dependabot[bot] Oct 26, 2024
940e0cc
[REF]: Table set edges
bosd Oct 24, 2024
e003fef
[FIX] update edges index out of range
bosd Oct 26, 2024
47803ef
Simplify and fix `Table.set_span`
tomprogrammer Aug 16, 2024
c2e2d77
Change `cell.hspan` and `cell.vspan` to properties
tomprogrammer Aug 17, 2024
a7d1d44
Remove redundant conditions on `Cell.vspan/hspan`
tomprogrammer Aug 19, 2024
72d95e3
Remove unused typing.Union import
bosd Oct 24, 2024
98d8aff
Fix docstring in core.py
bosd Oct 25, 2024
8858a74
core.py Fix missing docstring
bosd Oct 25, 2024
83b157e
Ghostsciptbackend flake8 fixes
bosd Oct 26, 2024
e38474f
Surpress flake8 warnings on poppler backend
bosd Oct 26, 2024
20f578d
Activate pre-commit on gh actions
bosd Oct 17, 2024
ae891d8
boundaries to split lines Fix index out of range
bosd Oct 27, 2024
bd4a393
Update docstrings, add backands and fallback
bosd Oct 26, 2024
ee450ad
Bump crazy-max/ghaction-github-labeler from 5.0.0 to 5.1.0
dependabot[bot] Oct 28, 2024
715e061
Bump et-xmlfile from 1.1.0 to 2.0.0
dependabot[bot] Oct 28, 2024
11b3f93
Bump safety-schemas from 0.0.5 to 0.0.8
dependabot[bot] Oct 28, 2024
a3f119e
Bump virtualenv from 20.27.0 to 20.27.1 in /.github/workflows
dependabot[bot] Oct 29, 2024
1b6189c
[REF] generate_table_bbox -> imp efficiency & fix type error unhashab…
bosd Oct 27, 2024
88a3c3d
[REF] gen_table_bbox: stop processing if all textlines have been proc…
bosd Oct 28, 2024
9a1bfcf
Bump pypa/gh-action-pypi-publish from 1.10.3 to 1.11.0
dependabot[bot] Oct 31, 2024
8727534
Add infinity test
bosd Oct 30, 2024
eac06c7
[REF]: Fix Network parser: generate_table_bbox executing infinitly
bosd Oct 30, 2024
ad4a79c
[REF]: core set_border: Improve performance, Fix index out of Range
bosd Oct 30, 2024
a66923f
[FIX] compute_parse_error, Index out of range
bosd Oct 31, 2024
f120302
[REF] remove_unconnected_edges
bosd Oct 28, 2024
d4d5556
Add failling test hybrid parser --> Keyerror
bosd Oct 31, 2024
b644f71
[FIX] Network/Hybrid parer Keyerror
bosd Oct 31, 2024
c392c7f
Merge pull request #248 from py-pdf/dependabot/github_actions/pypa/gh…
dependabot[bot] Oct 31, 2024
526685a
Merge pull request #241 from py-pdf/dependabot/pip/dot-github/workflo…
dependabot[bot] Oct 31, 2024
02dcc8d
Bump pip from 24.2 to 24.3.1 in /.github/workflows
dependabot[bot] Oct 31, 2024
fe41058
Merge pull request #234 from py-pdf/dependabot/pip/dot-github/workflo…
dependabot[bot] Oct 31, 2024
313f75b
[REF] Compute_plausible_gaps, Efficiency, Stability
bosd Oct 28, 2024
ad1babd
[REF] Compute_plausible_gaps, Efficiency, Stability
bosd Oct 31, 2024
35d8d20
Merge pull request #235 from py-pdf/dependabot/github_actions/crazy-m…
dependabot[bot] Oct 31, 2024
d819ceb
Add pypdfium2 rendering backend
mara004 Jun 23, 2023
af73826
Capture possible error message
mara004 Sep 25, 2023
2f6801c
Add pypdfium2 to nox
mara004 Sep 25, 2023
a2ec714
Adjust testdata hybrid_vertical_headers for result with pdfium backend
bosd Oct 13, 2024
8c1ba69
[IMP] hybrid test hybrid_vertical_headers to use pdfium with no fallback
bosd Oct 14, 2024
2dd8e6c
[IMP] Add docstrings and typing to pdfium backend
bosd Oct 13, 2024
f967d7b
[IMP] Set pdfium as default parser
bosd Oct 13, 2024
f4c6db2
add pdfium backend to new naming import
bosd Oct 13, 2024
bb802b5
Add tests for pdfium backend
bosd Oct 14, 2024
51fde54
update pdfium backend
bosd Oct 14, 2024
3964837
Update lockfile
bosd Oct 26, 2024
0c576d6
Bump virtualenv from 20.27.0 to 20.27.1
dependabot[bot] Nov 1, 2024
4856018
Merge pull request #238 from py-pdf/dependabot/pip/safety-schemas-0.0.8
dependabot[bot] Nov 1, 2024
b758d03
Bump safety from 3.2.9 to 3.2.10
dependabot[bot] Nov 1, 2024
e355695
Merge pull request #240 from py-pdf/dependabot/pip/virtualenv-20.27.1
dependabot[bot] Nov 1, 2024
219dd04
Merge pull request #239 from py-pdf/dependabot/pip/safety-3.2.10
dependabot[bot] Nov 1, 2024
c2266cb
Merge pull request #236 from py-pdf/dependabot/pip/et-xmlfile-2.0.0
dependabot[bot] Nov 1, 2024
bf1725b
Bump typeguard from 4.3.0 to 4.4.0
dependabot[bot] Nov 1, 2024
6dad42e
Update pandas and lockfile to fix #250
bosd Nov 2, 2024
bba13c1
Merge pull request #237 from py-pdf/dependabot/pip/typeguard-4.4.0
dependabot[bot] Nov 2, 2024
58a8317
Update precommit config to python 3.8 plus
bosd Nov 2, 2024
d2b76a9
:student: Quickstart notebook (#231)
bosd Nov 2, 2024
4da9b38
[ADD] Hybrid Parser notebook
bosd Nov 2, 2024
d67bb91
:student: Fixup quick_start_notebook
bosd Nov 2, 2024
a14d8fc
Fixup Typo
bosd Nov 2, 2024
f21df42
Bump setuptools from 75.2.0 to 75.3.0
dependabot[bot] Nov 4, 2024
e8f5a0c
Bump rich from 13.9.3 to 13.9.4
dependabot[bot] Nov 4, 2024
3f92769
Merge pull request #265 from py-pdf/dependabot/pip/setuptools-75.3.0
dependabot[bot] Nov 4, 2024
00c21c9
Merge pull request #266 from py-pdf/dependabot/pip/rich-13.9.4
dependabot[bot] Nov 4, 2024
6d2e6f7
Update version once for bumpersion to work
snanda85 Nov 4, 2024
2729b1f
Moving to the next major release
snanda85 Nov 5, 2024
1f2cd39
[CLEAN] Remove unused function find_columns_boundaries
bosd Nov 2, 2024
54c7eb3
[CLEAN] Remove unused function record_metadata
bosd Nov 2, 2024
36d868e
[CLEAN] Remove python2 support leftover for passwords
bosd Nov 2, 2024
6aebaaa
Remove old comment
bosd Nov 2, 2024
668d3c2
[IMP] add typing to handlers, update docstings and pdfminer url
bosd Nov 2, 2024
c29875c
:student: add parser comparison notebook :sparkles:
bosd Nov 2, 2024
ed8df80
Bump pypa/gh-action-pypi-publish from 1.11.0 to 1.12.0
dependabot[bot] Nov 6, 2024
c491782
Merge pull request #268 from py-pdf/dependabot/github_actions/pypa/gh…
dependabot[bot] Nov 6, 2024
7659ef4
Bump pypa/gh-action-pypi-publish from 1.12.0 to 1.12.2
dependabot[bot] Nov 7, 2024
39b4f20
Merge pull request #269 from py-pdf/dependabot/github_actions/pypa/gh…
dependabot[bot] Nov 7, 2024
81187e7
Bump typer from 0.12.5 to 0.13.0
dependabot[bot] Nov 8, 2024
bac56a3
Merge pull request #270 from py-pdf/dependabot/pip/typer-0.13.0
dependabot[bot] Nov 8, 2024
da4a69c
[REM] Remove ghostscript from base dependencies
bosd Nov 2, 2024
401463b
[ADD] Add ghostscript to optional group
bosd Nov 2, 2024
868242d
Eliminated duplicate processes.
RyosukeSakaguchi May 25, 2022
714b58e
Reduce loops to improve process speed
takaaki-mizuno May 25, 2022
6d10782
Fixed import omissions.
RyosukeSakaguchi May 25, 2022
87d7e10
Fix get_and_text_objects
bosd Nov 2, 2024
3f27b37
[REM] obsolete function get_text_objects, Update imports
bosd Nov 2, 2024
06f48a1
[REF]: further reduce object loops
bosd Nov 9, 2024
b470fd1
Various documentation fixes (#227)
bosd Nov 9, 2024
7022a84
Reflect new namespace in documentation
bosd Oct 26, 2024
f67ff4d
Docs: Reflect pdfium as standard conversion backend and move ghostscr…
bosd Oct 26, 2024
45c2171
Update installation instructions
bosd Oct 27, 2024
71069a2
Update pdfminer url to new pdfminer.six
bosd Nov 9, 2024
7bb70a7
Handlers.py Fixup leftover typing
bosd Nov 10, 2024
a106761
Fixup leftover renaming in documentation (#273)
bosd Nov 10, 2024
45dbd01
[REM]: History file from previous repo/package
bosd Nov 10, 2024
284c134
Make plot dependencies optional
bosd Nov 10, 2024
e13a460
Merge branch 'main' into fix_version_for_cli
bosd Nov 10, 2024
a8fbef0
Merge branch 'snanda85-fix_version_for_cli'
bosd Nov 10, 2024
a25e001
Bump dparse from 0.6.4b0 to 0.6.4
dependabot[bot] Nov 11, 2024
684c432
Bump pydantic from 2.5.3 to 2.9.2
dependabot[bot] Nov 11, 2024
f450b24
Merge pull request #276 from py-pdf/dependabot/pip/dparse-0.6.4
dependabot[bot] Nov 11, 2024
6c0a7c9
Merge pull request #279 from py-pdf/dependabot/pip/pydantic-2.9.2
dependabot[bot] Nov 11, 2024
c2eafd9
Bump packaging from 24.1 to 24.2
dependabot[bot] Nov 11, 2024
4a884a3
Merge pull request #278 from py-pdf/dependabot/pip/packaging-24.2
dependabot[bot] Nov 11, 2024
41599ce
Bump tomli from 2.0.2 to 2.1.0
dependabot[bot] Nov 12, 2024
da57448
Merge pull request #280 from py-pdf/dependabot/pip/tomli-2.1.0
dependabot[bot] Nov 12, 2024
7ab083b
Bump safety from 3.2.10 to 3.2.11
dependabot[bot] Nov 13, 2024
3f3fca6
Merge pull request #281 from py-pdf/dependabot/pip/safety-3.2.11
dependabot[bot] Nov 13, 2024
b30154b
[FIX]: Compute_whitespace IndexError
bosd Nov 13, 2024
b4bfb93
[REF]: compute_whitespace add typing
bosd Nov 13, 2024
b39b2b3
Release 1.0.1
bosd Nov 13, 2024
dc45ff3
Bump fonttools from 4.54.1 to 4.55.0
dependabot[bot] Nov 15, 2024
71f029a
Merge pull request #285 from py-pdf/dependabot/pip/fonttools-4.55.0
dependabot[bot] Nov 15, 2024
2252707
Bump typer from 0.13.0 to 0.13.1
dependabot[bot] Nov 19, 2024
071afac
Merge pull request #287 from py-pdf/dependabot/pip/typer-0.13.1
dependabot[bot] Nov 19, 2024
eb012c8
Bump tornado from 6.4.1 to 6.4.2
dependabot[bot] Nov 26, 2024
325fbf1
Merge pull request #289 from py-pdf/dependabot/pip/tornado-6.4.2
dependabot[bot] Nov 26, 2024
6e06a07
Bump virtualenv from 20.27.1 to 20.28.0 in /.github/workflows
dependabot[bot] Nov 28, 2024
55173c8
Bump virtualenv from 20.27.1 to 20.28.0
dependabot[bot] Nov 28, 2024
e34110e
Bump tomli from 2.1.0 to 2.2.1
dependabot[bot] Nov 28, 2024
362869d
Merge pull request #291 from py-pdf/dependabot/pip/virtualenv-20.28.0
dependabot[bot] Nov 28, 2024
557eca3
Merge pull request #292 from py-pdf/dependabot/pip/tomli-2.2.1
dependabot[bot] Nov 28, 2024
afb60a5
Merge pull request #290 from py-pdf/dependabot/pip/dot-github/workflo…
dependabot[bot] Nov 28, 2024
3a80390
Bump matplotlib from 3.9.2 to 3.9.3 in /docs
dependabot[bot] Dec 2, 2024
38f5daa
Bump typer from 0.13.1 to 0.14.0
dependabot[bot] Dec 2, 2024
02535bd
Bump pytest from 8.3.3 to 8.3.4
dependabot[bot] Dec 2, 2024
77c7521
Merge pull request #293 from py-pdf/dependabot/pip/docs/matplotlib-3.9.3
dependabot[bot] Dec 2, 2024
8a3668f
Merge pull request #295 from py-pdf/dependabot/pip/pytest-8.3.4
dependabot[bot] Dec 2, 2024
96c4761
Merge pull request #294 from py-pdf/dependabot/pip/typer-0.14.0
dependabot[bot] Dec 2, 2024
fb00118
Bump safety-schemas from 0.0.9 to 0.0.10
dependabot[bot] Dec 3, 2024
5703b5f
Merge pull request #296 from py-pdf/dependabot/pip/safety-schemas-0.0.10
dependabot[bot] Dec 3, 2024
ae1420a
Bump fonttools from 4.55.0 to 4.55.1
dependabot[bot] Dec 4, 2024
aa333a6
Bump typer from 0.14.0 to 0.15.0
dependabot[bot] Dec 4, 2024
0b19c39
Merge pull request #298 from py-pdf/dependabot/pip/typer-0.15.0
dependabot[bot] Dec 4, 2024
4ea7e2b
Merge pull request #297 from py-pdf/dependabot/pip/fonttools-4.55.1
dependabot[bot] Dec 4, 2024
26f46a5
Bump typer from 0.15.0 to 0.15.1
dependabot[bot] Dec 6, 2024
0c74317
Bump fonttools from 4.55.1 to 4.55.2
dependabot[bot] Dec 6, 2024
fb290f1
Bump six from 1.16.0 to 1.17.0
dependabot[bot] Dec 6, 2024
c657813
Merge pull request #299 from py-pdf/dependabot/pip/typer-0.15.1
dependabot[bot] Dec 6, 2024
c379581
Merge pull request #300 from py-pdf/dependabot/pip/fonttools-4.55.2
dependabot[bot] Dec 6, 2024
f922005
Merge pull request #301 from py-pdf/dependabot/pip/six-1.17.0
dependabot[bot] Dec 6, 2024
d5cd041
Bump safety from 3.2.11 to 3.2.12
dependabot[bot] Dec 11, 2024
8cf118e
Bump fonttools from 4.55.2 to 4.55.3
dependabot[bot] Dec 11, 2024
784adb4
Merge pull request #303 from py-pdf/dependabot/pip/safety-3.2.12
dependabot[bot] Dec 11, 2024
88ca0cb
Merge pull request #304 from py-pdf/dependabot/pip/fonttools-4.55.3
dependabot[bot] Dec 11, 2024
23b3b3f
Bump safety from 3.2.12 to 3.2.13
dependabot[bot] Dec 12, 2024
bdf96e1
Merge pull request #305 from py-pdf/dependabot/pip/safety-3.2.13
dependabot[bot] Dec 12, 2024
616d9ee
Bump certifi from 2024.8.30 to 2024.12.14
dependabot[bot] Dec 16, 2024
a9ca6ef
Bump matplotlib from 3.9.3 to 3.10.0 in /docs
dependabot[bot] Dec 16, 2024
86b7cf2
Merge pull request #307 from py-pdf/dependabot/pip/docs/matplotlib-3.…
dependabot[bot] Dec 16, 2024
f81133a
Merge pull request #306 from py-pdf/dependabot/pip/certifi-2024.12.14
dependabot[bot] Dec 16, 2024
959e997
Bump attrs from 24.2.0 to 24.3.0
dependabot[bot] Dec 17, 2024
3000c8f
Merge pull request #308 from py-pdf/dependabot/pip/attrs-24.3.0
dependabot[bot] Dec 17, 2024
7cb8587
Bump pydata-sphinx-theme from 0.16.0 to 0.16.1 in /docs
dependabot[bot] Dec 18, 2024
abf0e84
Merge pull request #309 from py-pdf/dependabot/pip/docs/pydata-sphinx…
dependabot[bot] Dec 18, 2024
0d84aba
Bump pypdfium2 from 4.30.0 to 4.30.1
dependabot[bot] Dec 20, 2024
e4dcaec
Merge pull request #310 from py-pdf/dependabot/pip/pypdfium2-4.30.1
dependabot[bot] Dec 20, 2024
1b1899d
Bump livereload from 2.7.0 to 2.7.1
dependabot[bot] Dec 20, 2024
a7bf917
Merge pull request #311 from py-pdf/dependabot/pip/livereload-2.7.1
dependabot[bot] Dec 20, 2024
e868739
Bump mypy from 1.13.0 to 1.14.0
dependabot[bot] Dec 23, 2024
93a543b
Bump safety from 3.2.13 to 3.2.14
dependabot[bot] Dec 23, 2024
e0c4a69
Bump click from 8.1.7 to 8.1.8
dependabot[bot] Dec 23, 2024
cf8af5f
Merge pull request #315 from py-pdf/dependabot/pip/click-8.1.8
dependabot[bot] Dec 23, 2024
5bf0857
Merge pull request #314 from py-pdf/dependabot/pip/safety-3.2.14
dependabot[bot] Dec 23, 2024
26bb320
Merge pull request #313 from py-pdf/dependabot/pip/mypy-1.14.0
dependabot[bot] Dec 23, 2024
2239f30
Bump jinja2 from 3.1.4 to 3.1.5
dependabot[bot] Dec 23, 2024
fee2b2a
Merge pull request #316 from py-pdf/dependabot/pip/jinja2-3.1.5
dependabot[bot] Dec 23, 2024
fbe65c3
Bump charset-normalizer from 3.4.0 to 3.4.1
dependabot[bot] Dec 25, 2024
01cba95
Merge pull request #317 from py-pdf/dependabot/pip/charset-normalizer…
dependabot[bot] Dec 25, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 7 additions & 3 deletions .flake8
Original file line number Diff line number Diff line change
@@ -1,9 +1,13 @@
[flake8]
select = B,B9,C,D,DAR,E,F,N,RST,S,W
ignore = E203,E501,RST201,RST203,RST301,W503
ignore = D401,E203,E501,RST201,RST203,RST301,RST305,W503
max-line-length = 120
max-complexity = 10
docstring-convention = google
per-file-ignores = tests/*:S101
docstring-convention = numpy
per-file-ignores = tests/*:B950,D100,D102,D103,D104,D401,D101,D200,S101,S106,F403,F405,F841
pypdf_table_extraction/*:D100,D103,D104,F401,W0611,C0114 docs/*:D100,D101
__init__.py:D100,D103,D104,F401
__version__.py:D100
__main__.py:D100,D103
rst-roles = class,const,func,meth,mod,ref
rst-directives = deprecated
1 change: 0 additions & 1 deletion .github/FUNDING.yml

This file was deleted.

9 changes: 4 additions & 5 deletions .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,15 +6,15 @@ labels: bug
assignees: ""
---

<!-- Please read the filing issues section of the contributor's guide first: https://camelot-py.readthedocs.io/en/master/dev/contributing.html -->
<!-- Please read the filing issues section of the contributor's guide first: https://pypdf-table-extraction.readthedocs.io/en/latest/dev/contributing.html#filing-issues -->

**Describe the bug**

<!-- A clear and concise description of what the bug is. -->

**Steps to reproduce the bug**

<!-- Steps used to install `camelot`:
<!-- Steps used to install `pypdf_table_extraction`:
1. Add step here (you can add more steps too) -->

<!-- Steps to be used to reproduce behavior:
Expand All @@ -26,10 +26,9 @@ assignees: ""

**Code**

<!-- Add the Camelot code snippet that you used. -->
<!-- Add the pypdf_table_extraction code snippet that you used. -->

```
import camelot

# add your code here
```
Expand All @@ -49,7 +48,7 @@ import camelot
- Numpy version:
- OpenCV version:
- Ghostscript version:
- Camelot version:
- pypdf_table_extraction version:

**Additional context**

Expand Down
6 changes: 3 additions & 3 deletions .github/workflows/constraints.txt
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
lockfile<0.13.0,>=0.12.2
pip==23.2.1
nox==2023.4.22
pip==24.3.1
nox==2024.10.9
nox-poetry==1.0.3
poetry>=1.2.0
virtualenv==20.24.5
virtualenv==20.28.0
3 changes: 1 addition & 2 deletions .github/workflows/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,6 @@ on:
push:
branches:
- main
- master

jobs:
labeler:
Expand All @@ -14,6 +13,6 @@ jobs:
uses: actions/checkout@v4

- name: Run Labeler
uses: crazy-max/ghaction-github-labeler@v5.0.0
uses: crazy-max/ghaction-github-labeler@v5.1.0
with:
skip-delete: true
23 changes: 7 additions & 16 deletions .github/workflows/release.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,6 @@ on:
push:
branches:
- main
- master

jobs:
release:
Expand All @@ -17,18 +16,18 @@ jobs:
fetch-depth: 2

- name: Set up Python
uses: actions/setup-python@v4
uses: actions/setup-python@v5
with:
python-version: "3.10"

- name: Upgrade pip
run: |
pip install --constraint=.github/workflows/constraints.txt pip
pip install --constraint=${PWD}/.github/workflows/constraints.txt pip
pip --version

- name: Install Poetry
run: |
pip install --constraint=.github/workflows/constraints.txt poetry
pip install --constraint=${PWD}/.github/workflows/constraints.txt poetry
poetry --version

- name: Check if there is a parent commit
Expand All @@ -39,7 +38,7 @@ jobs:
- name: Detect and tag new version
id: check-version
if: steps.check-parent-commit.outputs.sha
uses: salsify/[email protected].1
uses: salsify/[email protected].3
with:
version-command: |
bash -o pipefail -c "poetry version | awk '{ print \$2 }'"
Expand All @@ -57,21 +56,13 @@ jobs:

- name: Publish package on PyPI
if: steps.check-version.outputs.tag
uses: pypa/gh-action-pypi-publish@v1.8.10
uses: pypa/gh-action-pypi-publish@v1.12.2
with:
user: __token__
password: ${{ secrets.PYPI_TOKEN }}

- name: Publish package on TestPyPI
if: "! steps.check-version.outputs.tag"
uses: pypa/[email protected]
with:
user: __token__
password: ${{ secrets.TEST_PYPI_TOKEN }}
repository_url: https://test.pypi.org/legacy/
password: ${{ secrets.FLIT_PASSWORD }}

- name: Publish the release notes
uses: release-drafter/release-drafter@v5.24.0
uses: release-drafter/release-drafter@v6.0.0
with:
publish: ${{ steps.check-version.outputs.tag != '' }}
tag: ${{ steps.check-version.outputs.tag }}
Expand Down
68 changes: 47 additions & 21 deletions .github/workflows/tests.yml
Original file line number Diff line number Diff line change
@@ -1,8 +1,9 @@
name: Tests

on:
- push
- pull_request
push:
branches: [main]
pull_request:

jobs:
tests:
Expand All @@ -12,13 +13,20 @@ jobs:
fail-fast: false
matrix:
include:
# - { python: "3.10", os: "ubuntu-latest", session: "pre-commit" }
- { python: "3.10", os: "ubuntu-latest", session: "pre-commit" }
- { python: "3.10", os: "ubuntu-latest", session: "safety" }
# - { python: "3.11", os: "ubuntu-latest", session: "mypy" }
- {
python: "3.11",
os: "ubuntu-latest",
session: "mypy",
posargs:
["camelot", "tests", "docs/conf.py", "--allow-untyped-globals"],
}
# - { python: "3.10", os: "ubuntu-latest", session: "mypy" }
# - { python: "3.9", os: "ubuntu-latest", session: "mypy" }
# - { python: "3.8", os: "ubuntu-latest", session: "mypy" }
# - { python: "3.7", os: "ubuntu-latest", session: "mypy" }
- { python: "3.12", os: "ubuntu-latest", session: "tests" }
- { python: "3.11", os: "ubuntu-latest", session: "tests" }
- { python: "3.10", os: "ubuntu-latest", session: "tests" }
- { python: "3.9", os: "ubuntu-latest", session: "tests" }
Expand All @@ -39,13 +47,13 @@ jobs:
uses: actions/checkout@v4

- name: Set up Python ${{ matrix.python }}
uses: actions/setup-python@v4
uses: actions/setup-python@v5
with:
python-version: ${{ matrix.python }}

- name: Upgrade pip
run: |
pip install --constraint=.github/workflows/constraints.txt pip
pip install --constraint=${PWD}/.github/workflows/constraints.txt pip
pip --version

- name: Upgrade pip in virtual environments
Expand All @@ -59,13 +67,13 @@ jobs:

- name: Install Poetry
run: |
pipx install --pip-args=--constraint=.github/workflows/constraints.txt poetry
pipx install --pip-args=--constraint=${PWD}/.github/workflows/constraints.txt poetry
poetry --version

- name: Install Nox
run: |
pipx install --pip-args=--constraint=.github/workflows/constraints.txt nox
pipx inject --pip-args=--constraint=.github/workflows/constraints.txt nox nox-poetry
pipx install --pip-args=--constraint=${PWD}/.github/workflows/constraints.txt nox
pipx inject --pip-args=--constraint=${PWD}/.github/workflows/constraints.txt nox nox-poetry
nox --version

- name: Install ghostscript
Expand All @@ -74,6 +82,18 @@ jobs:
sudo apt update
sudo apt install ghostscript

- name: Install ghostscript (macos-latest)
if: matrix.os == 'macos-latest'
run: |
brew install ghostscript
mkdir -p ~/lib
ln -s "$(brew --prefix gs)/lib/libgs.dylib" ~/lib

- name: Install ghostscript (windows-latest)
if: matrix.os == 'windows-latest'
run: |
choco install --yes --no-progress --ignore-checksums ghostscript --version 9.56.1

- name: Compute pre-commit cache key
if: matrix.session == 'pre-commit'
id: pre-commit-cache
Expand All @@ -90,7 +110,7 @@ jobs:
print("::set-output name=result::{}".format(result))

- name: Restore pre-commit cache
uses: actions/cache@v3
uses: actions/cache@v4
if: matrix.session == 'pre-commit'
with:
path: ~/.cache/pre-commit
Expand All @@ -104,14 +124,16 @@ jobs:

- name: Upload coverage data
if: always() && matrix.session == 'tests'
uses: "actions/upload-artifact@v3"
uses: "actions/upload-artifact@v4"
with:
name: coverage-data
name: coverage-data-${{ matrix.session }}-${{ matrix.python }}-${{ matrix.os }}
path: ".coverage.*"
if-no-files-found: ignore
include-hidden-files: true

- name: Upload documentation
if: matrix.session == 'docs-build'
uses: actions/upload-artifact@v3
uses: actions/upload-artifact@v4
with:
name: docs
path: docs/_build
Expand All @@ -124,30 +146,31 @@ jobs:
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v4
uses: actions/setup-python@v5
with:
python-version: "3.10"

- name: Upgrade pip
run: |
pip install --constraint=.github/workflows/constraints.txt pip
pip install --constraint=${PWD}/.github/workflows/constraints.txt pip
pip --version

- name: Install Poetry
run: |
pipx install --pip-args=--constraint=.github/workflows/constraints.txt poetry
pipx install --pip-args=--constraint=${PWD}/.github/workflows/constraints.txt poetry
poetry --version

- name: Install Nox
run: |
pipx install --pip-args=--constraint=.github/workflows/constraints.txt nox
pipx inject --pip-args=--constraint=.github/workflows/constraints.txt nox nox-poetry
pipx install --pip-args=--constraint=${PWD}/.github/workflows/constraints.txt nox
pipx inject --pip-args=--constraint=${PWD}/.github/workflows/constraints.txt nox nox-poetry
nox --version

- name: Download coverage data
uses: actions/download-artifact@v3
uses: actions/download-artifact@v4
with:
name: coverage-data
pattern: coverage-data-*
merge-multiple: true

- name: Combine coverage data and display human readable report
run: |
Expand All @@ -158,4 +181,7 @@ jobs:
nox --session=coverage -- xml -i

- name: Upload coverage report
uses: codecov/[email protected]
uses: codecov/codecov-action@v4
env:
CODECOV_TOKEN: ${{ secrets.CODECOV_TOKEN }}
file: ./coverage.xml
45 changes: 45 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -7,5 +7,50 @@
/dist/
/docs/_build/
/src/*.egg-info/
/.pytest_cache
__pycache__/
/.ruff_cache
.idea/
*.py[cod]
*$py.class

# Distribution / packaging
.Python
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
lib/
lib64/
parts/
sdist/
var/
wheels/
pip-wheel-metadata/
share/python-wheels/
*.egg-info/
.installed.cfg
*.egg
MANIFEST

# Sphinx documentation
docs/_build/


# vscode
.vscode
.env
changelog.txt

# Jupyter Notebook
.ipynb_checkpoints

# mypy
.mypy_cache/
.dmypy.json
dmypy.json

# OSX Files
.DS_Store
2 changes: 1 addition & 1 deletion .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ repos:
entry: pyupgrade
language: system
types: [python]
args: [--py37-plus]
args: [--py38-plus]
- id: trailing-whitespace
name: Trim Trailing Whitespace
entry: trailing-whitespace-fixer
Expand Down
4 changes: 2 additions & 2 deletions CODE_OF_CONDUCT.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
Be cordial or be on your way. --Kenneth Reitz

https://www.kennethreitz.org/essays/2013/01/27/be-cordial-or-be-on-your-way
https://kennethreitz.org/essays/2013/01/27/be-cordial-or-be-on-your-way

# Contributor Covenant Code of Conduct

Expand Down Expand Up @@ -64,7 +64,7 @@ representative at an online or offline event.

Instances of abusive, harassing, or otherwise unacceptable behavior may be
reported to the community leaders responsible for enforcement at
https://github.com/camelot-dev/camelot/issues.
https://github.com/py-pdf/pypdf_table_extraction/issues .
All complaints will be reviewed and investigated promptly and fairly.

All community leaders are obligated to respect the privacy and security of the
Expand Down
Loading
Loading