Introduce POT for energy files #396

Sarkosos · 2025-03-10T11:43:41Z

No description provided.

… data

- Implement validation of intermediate checkpoints during MD simulation - Add new methods to get and validate intermediate checkpoints - Update constants with checkpoint validation parameters - Modify validator functions to support intermediate checkpoint validation - Enhance error handling and logging for checkpoint validation process

…king - Implement intermediate checkpoint submission method in FoldingMiner - Add SequentialCheckpointReporter to track checkpoints across simulation states - Update SimulationManager to calculate sequential checkpoint numbering - Enable retrieval and submission of specific checkpoint files

- Replace MDOutput Pydantic model with a standard dictionary - Update JobSubmissionSynapse to work with plain dictionary instead of Pydantic model - Modify deserialization logic to handle dictionary-based md_output - Remove unnecessary Pydantic import

mccrindlebrian · 2025-03-11T19:06:03Z

folding/protocol.py

@@ -78,12 +76,17 @@ def deserialize(self) -> int:
            self.md_output = {}
        else:
            md_output = {}
-            for k, v in self.md_output.items():
-                try:
-                    md_output[k] = base64.b64decode(v)


could you just have done md_output[k] = base64.b64decode(v) if v is not None else None?

Maybe, I just copied what we had for the JobSubmissionSynapse

folding/protocol.py

folding/registries/evaluation_registry.py

mccrindlebrian · 2025-03-11T19:13:11Z

folding/registries/evaluation_registry.py

+        responses = validator.dendrite.query(
+            synapse=synapse, axons=[axon], deserialize=True
+        )
+        return responses[0].cpt_files


based on the code this looks like responses[0].cpt_files is a tuple... Is this true? If true, then I think it would be better to return each value individually to be clearer

It's a dictionary

mccrindlebrian · 2025-03-11T19:16:49Z

folding/registries/evaluation_registry.py

    def name(self) -> str:
        return "SyntheticMD"

+    def validate_intermediate_checkpoints(self, validator, job_id, axon):


soooo much of this class is the same at the regular pipeline we have. Can we abstract the pipeline such that we don't have to iterate within this, but we give the pipeline a checkpoint file and whatever else it needs to do the validation? Then we would iterate at the higher level and just call the pipeline? this was my original idea when creating the registry, so you can just call

for data in X: evaluation_registry[task].validate(**data)

- Improve file attachment logic in attach_files_to_synapse - Add robust log file handling with header validation and combination - Update checkpoint saving in SimulationManager - Fix checkpoint reporter filename generation - Add more comprehensive error handling and logging

- Add traceback logging for intermediate checkpoint validation errors - Add logging for participation and step execution in validators - Update checkpoint selection logic to use the last step number - Add axons parameter to get_energies function

…/folding into features/energy-POT

Adjust the random checkpoint selection range to prevent potential index out of bounds error by subtracting 1 from the maximum checkpoint number

…e submission - Refactor log file combination using pandas for more robust processing - Add sorting of log files by step number - Simplify file attachment logic - Add logging for intermediate checkpoint submission

…/folding into features/energy-POT

- Convert validation methods in SyntheticMDEvaluator to async - Update method calls to use await for async operations - Modify checkpoint validation to check for exact number of checkpoints - Update error message for insufficient intermediate checkpoints - Adjust forward method calls to use async syntax in validators

… FoldingMiner - Extend BaseMinerNeuron with a new intermediate_submission_forward method - Update FoldingMiner to base64 encode checkpoint files for submission - Modify checkpoint file naming to use sequential numbering - Add cleanup for temporary combined log file after processing

…/folding into features/energy-POT

- Move the check for running simulations to a later point in the method - Ensure the event is updated correctly before returning simulation status - Maintain functionality for checking local storage for simulation data

…idation - Added logging for validity checks on final and intermediate checkpoints. - Updated checkpoint validation to include specific checkpoint numbers. - Refactored energy extraction logic to handle final and intermediate checkpoints separately. - Improved event dictionary structure to accommodate new energy data formats.

- Update the energy retrieval logic to use final miner energies instead of intermediate steps. - Remove redundant code related to max_step calculations for improved clarity.

- Remove the creation of the state file during initialization and instead generate it dynamically based on the checkpoint number. - Update the save and load state methods to use the newly constructed state file path. - Clean up redundant comments related to state file management.

- Update the log file naming convention to use the checkpoint number instead of the current state for improved clarity and consistency. - Remove the unused max_step variable to streamline the energy extraction logic.

…factor Refactor of the evaluator

Sarkosos added 6 commits March 8, 2025 14:09

Refactor protocol and validator to use Pydantic models for structured…

445f2a3

… data

Add miner-data to .gitignore

02e0ad0

Remove circular imports

5216f2b

mccrindlebrian reviewed Mar 11, 2025

View reviewed changes

folding/protocol.py Show resolved Hide resolved

mccrindlebrian reviewed Mar 11, 2025

View reviewed changes

folding/registries/evaluation_registry.py Outdated Show resolved Hide resolved

mccrindlebrian reviewed Mar 11, 2025

View reviewed changes

folding/registries/evaluation_registry.py Outdated Show resolved Hide resolved

mccrindlebrian reviewed Mar 11, 2025

View reviewed changes

Sarkosos added 12 commits March 12, 2025 09:51

Merge branch 'features/energy-POT' of https://github.com/macrocosm-os…

1a66a32

…/folding into features/energy-POT

Fix checkpoint selection range in SyntheticMDEvaluator

b76651b

Adjust the random checkpoint selection range to prevent potential index out of bounds error by subtracting 1 from the maximum checkpoint number

Merge branch 'features/energy-POT' of https://github.com/macrocosm-os…

af9e04b

…/folding into features/energy-POT

Merge branch 'features/energy-POT' of https://github.com/macrocosm-os…

1d6b62f

…/folding into features/energy-POT

Refactor simulation check logic in FoldingMiner

7bf26da

- Move the check for running simulations to a later point in the method - Ensure the event is updated correctly before returning simulation status - Maintain functionality for checking local storage for simulation data

updating energy extraction logic

91cafaf

Refactor evaluator to be able to validate on any given checkpoint

5e88b06

Sarkosos marked this pull request as ready for review March 17, 2025 10:46

Sarkosos added 5 commits March 17, 2025 14:23

make log_step a class attribute

128edce

Refactor energy extraction in SyntheticMDEvaluator

a817513

- Update the energy retrieval logic to use final miner energies instead of intermediate steps. - Remove redundant code related to max_step calculations for improved clarity.

Refactor log file naming in SyntheticMDEvaluator

eadec64

- Update the log file naming convention to use the checkpoint number instead of the current state for improved clarity and consistency. - Remove the unused max_step variable to streamline the energy extraction logic.

Sarkosos and others added 2 commits March 19, 2025 11:59

removed pre loading of the pdb to avoid reproducibility issues

7e5ed81

Merge pull request #401 from macrocosm-os/features/energy-POT-eval-re…

ffb2d18

…factor Refactor of the evaluator

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce POT for energy files #396

Introduce POT for energy files #396

Sarkosos commented Mar 10, 2025

mccrindlebrian Mar 11, 2025

Sarkosos Mar 12, 2025

mccrindlebrian Mar 11, 2025

Sarkosos Mar 12, 2025

mccrindlebrian Mar 11, 2025

Introduce POT for energy files #396

Are you sure you want to change the base?

Introduce POT for energy files #396

Conversation

Sarkosos commented Mar 10, 2025

mccrindlebrian Mar 11, 2025

Choose a reason for hiding this comment

Sarkosos Mar 12, 2025

Choose a reason for hiding this comment

mccrindlebrian Mar 11, 2025

Choose a reason for hiding this comment

Sarkosos Mar 12, 2025

Choose a reason for hiding this comment

mccrindlebrian Mar 11, 2025

Choose a reason for hiding this comment