Major environment refactoring (base version) #169

cbhua · 2024-04-30T13:54:16Z

Description

Please refer to the full environment refactor PR: #166.

Motivation and Context

Please refer to the full environment refactor PR: #166.

Types of changes

Compare with the full environment refactor, this base refactor only touches:

Environment structure;
Documentation updates;
Supporting generator;

And as a base refactor, this PR keeps:

All the data generate processes (size, default range, default distribution) are not modified;
All the step, reward calculation, etc logics are not modified;

Overall, this base version PR only works on moving code and doesn't change the logic for consequence. In the future version, we will refactor environments with the guide of the full refactor version step by step.

For more details, please refer to the full environment refactor PR: #166.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

fedebotu

🚀

fedebotu · 2024-05-01T00:30:32Z

rl4co/envs/routing/cvrp/generator.py

+        demand_distribution: Union[
+            int, float, type, Callable
+        ] = Uniform,
+        vehicle_capacity: float = 1.0,


~~I think vehicle_capacity and capacity should be the same parameter, is it right?~~

EDIT: my bad - capacity is actually the "maximum capacity" to normalize

Yes, maybe later we want to rename the capacity. I think now this may be confusing to some users.

Ok, let's leave it as simple as possible for now

rl4co/envs/routing/cvrp/generator.py

fedebotu · 2024-05-01T00:37:28Z

rl4co/envs/routing/mpdp/env.py

 log = get_pylogger(__name__)


 class MPDPEnv(RL4COEnvBase):
-    """Multi-agent Pickup and Delivery Problem environment.
+    """Multi-agent Pickup and Delivery Problem (mPDP) environment.


Note that actually this should be called "mPDTSP", but I guess we can leave it like that for now

rl4co/envs/routing/pctsp/generator.py

fedebotu · 2024-05-01T00:41:44Z

rl4co/envs/routing/pctsp/generator.py

+            self.depot_sampler = get_sampler("depot", depot_distribution, min_loc, max_loc, **kwargs)
+
+        # Prize distribution
+        self.deterministic_prize_sampler = get_sampler("deterministric_prize", "uniform", 0.0, 4.0/self.num_loc, **kwargs)


This one cannot be changed via kwargs, right?

Yes, since there is a specific rule for the value range of the deterministic_prize and also the following stochastic_prize. Here I hardcode the distribution to Uniform.

cbhua added 9 commits April 30, 2024 15:49

[Feat] supporting generator for environemtns

7048ef6

[Refactor] main moving environment files

b2759b9

[Refactor] update init path

c0cea70

[Refacor] update ffsp path in matnet policy

6240ab1

[Refactor] supporting generator

a1a845b

[Refactor] update ffsp env init parameters

3a5b3de

[Refactor, Doc] update comments for envs and generators

01b8884

[Merge] sync the refactor-env-base with 'main'

5739031

[BugFix] update spec for the latest torchrl

4996e98

cbhua mentioned this pull request Apr 30, 2024

Major environment refactoring (draft version) #166

Closed

8 tasks

cbhua marked this pull request as ready for review April 30, 2024 14:42

fedebotu self-requested a review May 1, 2024 00:21

fedebotu approved these changes May 1, 2024

View reviewed changes

[Monor] fix cloest -> 'closest' typo

862731b

fedebotu merged commit f7c984c into main May 1, 2024
24 checks passed

cbhua deleted the refactor-env-base branch May 10, 2024 04:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Major environment refactoring (base version) #169

Major environment refactoring (base version) #169

cbhua commented Apr 30, 2024

fedebotu left a comment

fedebotu May 1, 2024

cbhua May 1, 2024 •

edited

Loading

fedebotu May 1, 2024

fedebotu May 1, 2024

fedebotu May 1, 2024

cbhua May 1, 2024

Major environment refactoring (base version) #169

Major environment refactoring (base version) #169

Conversation

cbhua commented Apr 30, 2024

Description

Motivation and Context

Types of changes

Checklist

fedebotu left a comment

Choose a reason for hiding this comment

fedebotu May 1, 2024

Choose a reason for hiding this comment

cbhua May 1, 2024 • edited Loading

Choose a reason for hiding this comment

fedebotu May 1, 2024

Choose a reason for hiding this comment

fedebotu May 1, 2024

Choose a reason for hiding this comment

fedebotu May 1, 2024

Choose a reason for hiding this comment

cbhua May 1, 2024

Choose a reason for hiding this comment

cbhua May 1, 2024 •

edited

Loading