Unsound command line options should be documented and warnings given when used. #6397

jimgrundy · 2021-10-15T13:26:49Z

The --reachability-slice option (and other slices) produce results that are unsound.

This request has two parts:
1/ All command line options that can produce unsound results should be clearly marked as such in the command's help.
2/ Any command run with an unsound command line option should produce in the output a warning indicating that the results are unsound and which option(s) are the reason for the potential unsoundness.

martin-cs · 2021-10-19T13:38:10Z

@jimgrundy I very much agree with the principle but none of the slices should produce unsound results, esp. --reachability-slice. If you have the time to file a bug report for what is going wrong, that would be great.

martin-cs · 2021-10-19T13:41:07Z

Ah, is #6394 what prompted this?

jimgrundy · 2021-10-19T18:04:56Z

Not directly, but yes via internal chats that led up to #6394. @kroening said that the slicers were not sound and should not be used. My feeling is that if there are command line options that we know might produce unsound results then that needs to be documented and warned. You might have to follow up with @kroening for examples of unsound slicers beyond those reported in #6349 and #260.

martin-cs · 2021-10-19T21:58:48Z

Perhaps it is useful to draw a distinction between:

A. Functionality that is (intentionally or unintentionally) unsound by design / choice of algorithm,
and
B. Functionality that is in-principle sound but has known or suspected issues of correctness.

A. we should definitely warn for, B. we should work towards eliminating.

As far as I know, issues with the slicers are of type B. They are just waiting on someone having time to fix them.

martin-cs · 2021-10-19T21:59:51Z

In the case of #260 I would be tempted to try re-running the benchmarks because I know there have been bug fixes to that code in the ... 5 years since it was first reported.

jimgrundy · 2021-10-19T22:16:57Z

I agree with @martin-cs on A vs B in principle, but if the time in which issues in class B lie around is measured in years rather than weeks then we risk having folks using these features and thinking they have proofs of production code when they do not. One suggestion would be to rename "--feature-X" to "--experimental-feature-X" until soundness queries are resolved. That would at least convey that feature-X is supposed to be sound, but I wouldn't bank on it just yet.

martin-cs · 2021-10-20T08:30:10Z

It's not unreasonable that "this should work but ... last I checked it didn't" should be somewhere other than in the heads of a few developers and implied by the bug tracker.

Renaming options is always awkward. What I wonder is whether we need a "--sound-options-only" or "--no-experimental" flag that will conflict if given with anything not on a white-list of options. Test coverage and no option bugs seem like a good criteria for adding things to the white-list.

I guess underlying this is one of Daniel's long-standing design goals which was to minimise the number of flags, esp. those that set magic constants. Over the last few years we haven't done as well at this as we should do.

jimgrundy · 2021-10-20T14:17:05Z

I like this last idea, but I would like to flip the sense. The tool should be presumed sound by default, so rather than having a "--forbid-stuff-we-dont-think-is-sound" option, those should be forbidden by default and the options should be "--experimental" (i.e. --allow-stuff-we-dont-think-is-sound").

martin-cs · 2021-10-20T14:38:06Z

Yes; you are right that's a better solution. --I-accept-this-might-give-wrong-answers?

jimgrundy · 2021-11-11T19:52:49Z

It looks like our desire is this:

Add a --unsound flag, that allows intentionally unsound options
Add a --experimental flag, that allows options that we want to be sound, but aren't confident in
Using any option that is intentionally unsound should get you a warning unless you used the --unsound flag.
Using any option that should be sound but we aren't confident in should get you a warning unless you used the --experimental flag
In a future major release those warnings should become errors

jimgrundy · 2021-12-16T21:02:33Z

Happy to close this and pick up the discussion over on #6480

jimgrundy added aws Bugs or features of importance to AWS CBMC users aws-high labels Oct 15, 2021

danielsn added the soundness Soundness bug? Review and add "aws" if it is, or remove "soundness" if it isn't. label Oct 15, 2021

NlightNFotis mentioned this issue Nov 25, 2021

RFC: How to handle unsound flags #6480

Closed

jimgrundy closed this as completed Dec 16, 2021

TGWDB mentioned this issue May 31, 2023

[RFC] Version 6 #7743

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unsound command line options should be documented and warnings given when used. #6397

Unsound command line options should be documented and warnings given when used. #6397

jimgrundy commented Oct 15, 2021

martin-cs commented Oct 19, 2021

martin-cs commented Oct 19, 2021

jimgrundy commented Oct 19, 2021

martin-cs commented Oct 19, 2021

martin-cs commented Oct 19, 2021

jimgrundy commented Oct 19, 2021 •

edited

Loading

martin-cs commented Oct 20, 2021

jimgrundy commented Oct 20, 2021

martin-cs commented Oct 20, 2021

jimgrundy commented Nov 11, 2021

jimgrundy commented Dec 16, 2021

Unsound command line options should be documented and warnings given when used. #6397

Unsound command line options should be documented and warnings given when used. #6397

Comments

jimgrundy commented Oct 15, 2021

martin-cs commented Oct 19, 2021

martin-cs commented Oct 19, 2021

jimgrundy commented Oct 19, 2021

martin-cs commented Oct 19, 2021

martin-cs commented Oct 19, 2021

jimgrundy commented Oct 19, 2021 • edited Loading

martin-cs commented Oct 20, 2021

jimgrundy commented Oct 20, 2021

martin-cs commented Oct 20, 2021

jimgrundy commented Nov 11, 2021

jimgrundy commented Dec 16, 2021

jimgrundy commented Oct 19, 2021 •

edited

Loading