[324] Fix rev dep #325

zachmayer · 2024-08-14T17:09:13Z

fix for prediction issue with old, saved earth/bam models in caret, where they return arrays or matricies
fix for predictions with new and old neuralnet and klar models that need a matrix as input - this adds a few new dependencies to suggests as they're needed for this test
add a release only check that we can fit and predict a large number different caret models
add seed to readme and re add readme pr check

Fix for #324

https://github.com/zachmayer/caretEnsemble/actions/runs/10276799499/job/28438114871#step:4:46

The new caretEnsemble package breaks some old, saved caret::train models that are earth::earth models.

This PR adds a serialized version of such a model as well as a backwards compatibility test to make sure it works. I also added a line of code to fix the test.

There's also a fix for some packages that require matricies at prediction time.

Also add a test as part of the release process that we can do a caretList and predict for basically all models in caret.

coderabbitai · 2024-08-14T17:09:30Z

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

This update introduces a comprehensive suite of changes to enhance the functionality and reliability of an R package. Key improvements include the addition of automated testing workflows via GitHub Actions, refinements in package dependencies, updates to model evaluation metrics, and enhancements to prediction logic. Furthermore, new tests and scripts for backward compatibility and model validation have been established, ensuring that the package maintains its integrity across updates.

Changes

Files	Change Summary
`.github/workflows/readme.yaml`	Added a GitHub Actions workflow to automate testing, documentation updates, and validation of the `README.md` file.
`DESCRIPTION`	Updated suggested packages to include `earth`, `klaR`, `mgcv`, and `pkgdown`; removed `usethis`.
`Makefile`	Introduced a new target `check-many-preds` for model prediction checks and updated the `clean` target to remove `.Rout` files.
`R/caretPredict.R`	Enhanced prediction logic for models; added checks for output validation and backward compatibility.
`README.md`	Updated model evaluation metrics, including RMSE values for various models reflecting improved predictive accuracy.
`README.rmd`	Added `set.seed(42L)` for reproducible results in random sampling examples.
`inst/WORDLIST`	Added back the words "setosa" and "yhat" to the word list.
`inst/data-raw/build_backwards_compatability_data.R`	Introduced a script to isolate and test backward compatibility issues related to `caretEnsemble` models.
`inst/data-raw/test-all_models.R`	New script for comprehensive testing of various machine learning models, ensuring they produce valid predictions.
`tests/testthat/test-backwards-compatability.R`	Created tests to ensure older models saved in `caretStack` format remain functional.
`tests/testthat/test-caretList.R`	Added tests to validate `caretList` functionality with models returning arrays/matrices, ensuring correct output structure.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant GitHub Actions
    participant R Environment
    participant Package
    
    User->>GitHub Actions: Push changes
    GitHub Actions->>R Environment: Run tests and build
    R Environment->>Package: Execute `make readme`
    Package->>R Environment: Validate README changes
    R Environment-->>GitHub Actions: Report results
    GitHub Actions-->>User: Notify results

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share

Tips

Chat

There are 3 ways to chat with CodeRabbit:

‼️ IMPORTANT
Auto-reply has been disabled for this repository in the CodeRabbit settings. The CodeRabbit bot will not respond to your replies unless it is explicitly tagged.

Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai generate interesting stats about this repository and render them as a table.
- @coderabbitai show all the console.log statements in this repository.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (invoked as PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Additionally, you can add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

Outside diff range, codebase verification and nitpick comments (4)

inst/data-raw/test-all_models.R (2)
25-40: Consider documenting the exclusion criteria for java_models.

While the exclusion of certain models is clear, adding a comment to explain why these specific models are excluded could improve the maintainability and clarity of the code.
+  # Excluding models that require Java or are otherwise problematic
  java_models <- c(
    "gbm_h2o",
    "glmnet_h2o",
    ...
  )
84-106: Thorough testing of model predictions.

The test block effectively checks both the ability to predict and the handling of stacked predictions. The use of expect_identical and expect_true ensures that predictions are as expected. However, consider adding more detailed comments to explain the purpose of each test step, especially regarding the handling of models that predict Inf values.
+  # Ensure predictions are made for all models and check for finite values
  testthat::expect_identical(nrow(pred), 5L)
  testthat::expect_identical(ncol(pred), length(all_models))
  testthat::expect_true(all(unlist(lapply(pred, is.finite))))
inst/data-raw/build_backwards_compatability_data.R (2)
1-9: Clarify the purpose and usage of the script.

The initial comments provide context about the script's purpose but could benefit from more structured documentation. Consider adding a brief summary of the script's functionality and intended usage.
+  # This script isolates and prunes a problematic model for testing backward compatibility.
  # This script is a little big sorry.
  # We're using a 3rd party dataset from a package
  ...
53-83: Iterative pruning logic is well-implemented.

The prune_list_iterative function systematically removes elements from the model object. The logic for determining whether to keep or remove elements is sound. However, consider adding comments to explain the decision-making process in more detail.
+  # If error changes or warnings appear, retain the element; otherwise, remove it
  if ((!is.null(result$error) && result$error != "is.vector(pred) is not TRUE") || !is.null(result$wrns)) {
    ...
  }

Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

Commits

Files that changed from the base of the PR and between f8df26b and d73acd5.

Files ignored due to path filters (2)

man/figures/README-greedy-stack-6-plot-1.png is excluded by !**/*.png
man/figures/README-unnamed-chunk-5-1.png is excluded by !**/*.png

Files selected for processing (11)

.github/workflows/readme.yaml (1 hunks)
DESCRIPTION (1 hunks)
Makefile (4 hunks)
R/caretPredict.R (1 hunks)
README.md (5 hunks)
README.rmd (1 hunks)
inst/WORDLIST (2 hunks)
inst/data-raw/build_backwards_compatability_data.R (1 hunks)
inst/data-raw/test-all_models.R (1 hunks)
tests/testthat/test-backwards-compatability.R (1 hunks)
tests/testthat/test-caretList.R (1 hunks)

Files skipped from review due to trivial changes (1)

inst/WORDLIST

Additional comments not posted (28)

.github/workflows/readme.yaml (4)

1-5: Triggers are correctly set.

The workflow triggers on push and pull request events for the main and master branches, which is appropriate for CI workflows.

7-14: Job setup follows best practices.

The job is named Tests, runs on ubuntu-latest, and includes steps for checking out the repository and setting up R, which are standard practices.

14-19: R environment setup is correctly implemented.

The workflow uses r-lib/actions/setup-r to set up R and r-lib/actions/setup-r-dependencies to install additional packages, which is a standard approach.

20-30: README check is a useful addition.

The step checks if README.md has changed and exits with an error if it has, ensuring documentation consistency.

tests/testthat/test-backwards-compatability.R (1)

1-25: Backward compatibility test is well-structured.

The test effectively ensures that old models are loaded correctly and predictions are made in the expected format, covering key aspects of backward compatibility.

DESCRIPTION (5)

24-24: Addition of earth to Suggests aligns with PR objectives.

The earth package is added to support features or testing related to earth models, as mentioned in the PR summary.

27-27: Addition of klaR to Suggests aligns with PR objectives.

The klaR package is added to support compatibility with klar models, as mentioned in the PR summary.

30-30: Addition of mgcv to Suggests likely supports improvements.

The mgcv package is added, possibly to support new features or testing requirements, aligning with the overall improvements.

33-33: Addition of pkgdown to Suggests enhances documentation capabilities.

The pkgdown package is added to support documentation generation, which is beneficial for package maintenance.

40-40: Removal of usethis from Suggests aligns with updated requirements.

The usethis package is removed, possibly reflecting a shift in development practices or reduced need.

inst/data-raw/test-all_models.R (3)

8-10: Use of suppressWarnings and suppressMessages is appropriate here.

The function very_quiet is designed to suppress warnings and messages, which is suitable for testing scenarios where output noise should be minimized. This is a good practice for ensuring that only relevant test failures are reported.

15-24: Data setup is clear and concise.

The data setup section uses a fixed seed for reproducibility, which is essential for consistent test results. The creation of the X matrix and y vector is straightforward and well-documented.

60-65: Model exclusion rationale is well-considered.

The exclusion of models that are too slow or cannot be installed is a practical approach to ensure the tests run efficiently. This helps focus on models that are feasible to test in a reasonable timeframe.
inst/data-raw/build_backwards_compatability_data.R (3)
24-51: Robust error and warning handling.

The test_error function captures both errors and warnings effectively. The use of tryCatch and withCallingHandlers is appropriate for isolating issues within the model. This approach is well-suited for debugging complex models.

108-115: Ensure saved model file path consistency.

The saveRDS function saves the pruned model to a specific path. Ensure that this path remains consistent with the project's directory structure and update it if necessary.

Verification successful

File path consistency verified.

The directory tests/testthat/data exists, and there are no conflicting references to this path elsewhere in the codebase. The path used in the saveRDS function is consistent with the project's directory structure.
Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Description: Verify the existence of the saved model file path.

# Test: Check if the directory path for saving the model exists.
if [ ! -d "tests/testthat/data" ]; then
  echo "Directory tests/testthat/data does not exist. Please create it or update the path."
  exit 1
fi
Length of output: 36

Script:
#!/bin/bash
# Search for references to the path "tests/testthat/data" in the codebase
rg "tests/testthat/data"
Length of output: 24
11-12: Ensure package version availability.

The script installs a specific version of LDLcalc. Ensure that this version remains available on CRAN or consider alternative methods to handle potential unavailability in the future.
Makefile (4)

24-24: Addition of check-many-preds target is beneficial.

Including the check-many-preds target in the help section provides clarity on its purpose and ensures users are aware of this new functionality.

139-142: Integration of check-many-preds target.

The check-many-preds target runs a script to validate predictions on numerous models. This integration is a valuable addition to ensure model reliability before release.

154-154: Ensure check-many-preds is executed during release.

By including check-many-preds in the release process, you ensure that model predictions are validated, enhancing the robustness of the release.

166-166: Improved cleanup with .Rout removal.

The addition of .Rout files to the clean target helps maintain a tidy workspace by removing temporary files generated during R script execution.

README.rmd (1)

42-42: Good practice: Set seed for reproducibility.

The addition of set.seed(42L) ensures that the random sampling of the diamonds dataset is reproducible, which is a good practice for examples and tests.
README.md (3)
72-82: Update in RMSE and weights for greedy ensemble.

The RMSE and model weights in the greedy ensemble have been updated. Verify these changes align with updates in the model or training data.

Verification successful

Verification Successful: RMSE and Weights Updates in README.md

The updates to the RMSE and weights for the greedy ensemble in the README.md align with the underlying code and tests. The greedyMSE function handles these calculations, and the test cases verify their accuracy. No discrepancies found.
Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Description: Verify consistency of RMSE and weights updates in greedy ensemble.

# Test: Search for changes in ensemble training or data affecting RMSE and weights.
rg --type r 'caretEnsemble'
Length of output: 10670

Script:
#!/bin/bash
# Search for RMSE calculations in the context of the greedy ensemble
rg 'RMSE' -A 5

# Search for weight assignments in the context of the greedy ensemble
rg 'Weights' -A 5
Length of output: 12637
52-53: Update in RMSE values for model accuracy.

The RMSE values for the random forest and glmnet models have been updated, indicating changes in model performance. Ensure these updates are consistent with any changes in the model or data.

Line range hint 101-115: Update in RMSE and variance explained for non-linear ensemble.

The RMSE and variance explained in the non-linear ensemble have been updated. Ensure these reflect changes in model configuration or data.

Verification successful

Verification Successful: RMSE and Variance Explained Updates Reflected

The README.md file correctly reflects the updated RMSE and variance explained for the non-linear ensemble using caretStack. The values are consistent with recent changes in the codebase.

RMSE: 1020.387

Variance Explained: 93.33%
Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Description: Verify consistency of RMSE and variance updates in non-linear ensemble.

# Test: Search for changes in non-linear ensemble affecting RMSE and variance.
rg --type r 'caretStack'
Length of output: 12143

Script:
#!/bin/bash
# Description: Search for recent changes in the codebase related to RMSE and variance explained.

# Look for recent commits that mention RMSE or variance.
git log -p -S 'RMSE' -- README.md
git log -p -S 'variance explained' -- README.md
Length of output: 7750
tests/testthat/test-caretList.R (1)

295-326: Enhanced test coverage for caretList with array/matrix models.

The new test case effectively validates caretList functionality with models that return arrays or matrices, such as "earth", "gam", and "stepLDA". This addition is well-structured and improves test coverage.

R/caretPredict.R (3)

31-32: Validate data frame output for probability predictions.

The stopifnot statement ensures that the probability predictions are returned as a data frame. This is a good practice to validate the output type.

34-38: Validate numeric vector output for raw predictions.

The stopifnot and conversion to a vector ensure that raw predictions are numeric vectors. This is important for backward compatibility with older models.

27-29: Ensure matrix conversion is necessary.

The conversion of newdata to a matrix for "neuralnet" and "klaR" libraries is a necessary adjustment. Ensure that this conversion is consistently required for these libraries across all scenarios.

zachmayer added 14 commits August 13, 2024 16:35

handle array/matrix case

4e43b94

usethis

2ebfc1a

nbrbrbr

26be295

just a regular test

b7560c3

ok the test script works

b5b2076

script works

33c0ce9

rebuild

3611a50

rebuild

c2b9c73

add test case

8f06507

shrink it

85bcad0

add failing test

1902928

fix failing test

d9114fd

add seed to readme lol

6384040

re add readme workflow, with a seed it should pass now

d73acd5

coderabbitai bot reviewed Aug 14, 2024

View reviewed changes

ok different platforms = different rf results.

0110c7b

zachmayer merged commit eaee3bb into main Aug 14, 2024
13 checks passed

zachmayer deleted the fix-rev-dep branch August 14, 2024 17:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[324] Fix rev dep #325

[324] Fix rev dep #325

zachmayer commented Aug 14, 2024

coderabbitai bot commented Aug 14, 2024 •

edited

Loading

Review skipped

Chat

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

[324] Fix rev dep #325

[324] Fix rev dep #325

Conversation

zachmayer commented Aug 14, 2024

coderabbitai bot commented Aug 14, 2024 • edited Loading

Review skipped

Walkthrough

Changes

Sequence Diagram(s)

Chat

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot commented Aug 14, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)