`EpochIterator`, `TensorFlowTrainer` : refactoring and fixes #20396

nicolaspi · 2024-10-22T18:17:56Z

Fixes #20394
Fixes #20390
Fixes #20344

TensorFlowTrainer :

Added usage of iterator.get_next_as_optional(), which enables steps_per_execution to exceed the previous limit of 32 while preventing tf.errors.OutOfRangeError

EpochIterator :

Refactored to unify code base across different backends
Fixed edge cases involving various combinations of steps_per_epoch and steps_per_execution

Tensorflow trainer's train and test functions refactoring Fix keras-team#20394 Fix keras-team#20390 Fix keras-team#20344

fchollet

This is neat -- thanks for the PR.

fchollet · 2024-10-22T19:14:05Z

keras/src/trainers/epoch_iterator.py

    @property
    def num_batches(self):
        if self.steps_per_epoch:
            return self.steps_per_epoch
        # Either copied from the data_adapter, or
        # inferred at the end of an iteration.
        return self._num_batches
+
+
+class EpochIterator(_EpochIterator):


Couldn't the two classes be merged into one?

Yes, I merged them.

fchollet · 2024-10-22T19:14:29Z

keras/src/trainers/epoch_iterator.py

+class EpochIterator(_EpochIterator):
+    def __next__(self):
+        buffer = []
+        step, iterator = super().__next__()


Just need to call next(self._epoch_iterator)

fchollet · 2024-10-22T19:16:46Z

keras/src/trainers/epoch_iterator.py

@@ -75,55 +75,86 @@ def __init__(
    def _get_iterator(self):
        return self.data_adapter.get_numpy_iterator()

-    def enumerate_epoch(self):


This method must be kept for backwards compatibility (for users who have a custom fit())

fchollet · 2024-10-22T19:19:44Z

It's fairly confusing to have backend-specific iterators subclass _EpochIterator -- much better to have a single EpochIterator class and have backend-specific classes override what they need to change.

Restored `enumerate_epoch`

nicolaspi · 2024-10-22T19:37:53Z

It's fairly confusing to have backend-specific iterators subclass _EpochIterator -- much better to have a single EpochIterator class and have backend-specific classes override what they need to change.

I removed _EpochIterator.

fchollet

LGTM, thank you for the contribution!

nicolaspi added 4 commits October 22, 2024 16:58

EpochIterator refactoring

5365df8

Tensorflow trainer's train and test functions refactoring Fix keras-team#20394 Fix keras-team#20390 Fix keras-team#20344

CI test

c93da39

CI fix

c05177c

revert CI conf

10140be

google-ml-butler bot added the size:L label Oct 22, 2024

google-ml-butler bot assigned gbaned Oct 22, 2024

fchollet reviewed Oct 22, 2024

View reviewed changes

Removed _EpochIterator

d5db3b4

Restored `enumerate_epoch`

nicolaspi requested a review from fchollet October 22, 2024 19:48

google-ml-butler bot added the awaiting review label Oct 22, 2024

Fix enumerate_epoch

75daac6

fchollet added the kokoro:force-run label Oct 22, 2024

kokoro-team removed the kokoro:force-run label Oct 22, 2024

fchollet approved these changes Oct 22, 2024

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Oct 22, 2024

fchollet merged commit 6b662d1 into keras-team:master Oct 22, 2024
9 checks passed

google-ml-butler bot removed awaiting review ready to pull Ready to be merged into the codebase kokoro:force-run labels Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`EpochIterator`, `TensorFlowTrainer` : refactoring and fixes #20396

`EpochIterator`, `TensorFlowTrainer` : refactoring and fixes #20396

nicolaspi commented Oct 22, 2024 •

edited

Loading

fchollet left a comment

fchollet Oct 22, 2024

nicolaspi Oct 22, 2024

fchollet Oct 22, 2024

nicolaspi Oct 22, 2024

fchollet Oct 22, 2024

nicolaspi Oct 22, 2024

fchollet commented Oct 22, 2024

nicolaspi commented Oct 22, 2024

fchollet left a comment

EpochIterator, TensorFlowTrainer : refactoring and fixes #20396

EpochIterator, TensorFlowTrainer : refactoring and fixes #20396

Conversation

nicolaspi commented Oct 22, 2024 • edited Loading

fchollet left a comment

Choose a reason for hiding this comment

fchollet Oct 22, 2024

Choose a reason for hiding this comment

nicolaspi Oct 22, 2024

Choose a reason for hiding this comment

fchollet Oct 22, 2024

Choose a reason for hiding this comment

nicolaspi Oct 22, 2024

Choose a reason for hiding this comment

fchollet Oct 22, 2024

Choose a reason for hiding this comment

nicolaspi Oct 22, 2024

Choose a reason for hiding this comment

fchollet commented Oct 22, 2024

nicolaspi commented Oct 22, 2024

fchollet left a comment

Choose a reason for hiding this comment

`EpochIterator`, `TensorFlowTrainer` : refactoring and fixes #20396

`EpochIterator`, `TensorFlowTrainer` : refactoring and fixes #20396

nicolaspi commented Oct 22, 2024 •

edited

Loading