docs/alloy-models: add new folder for Alloy models along w/ model for Linear Fee Function bug fix #8943

Roasbeef · 2024-07-27T02:12:11Z

Alloy Models

This PR adds a new Alloy Model, which is a method of "lightweight formal models"
written using the Alloy model checker and language.

Compared to typical formal methods, Alloy is a bounded model checker,
considered a lightweight method to formal analysis.
Lightweight formal methods are easier to use than fully
fledged formal methods as rather than attempting to prove the that a model is
always correct (for all instances), they instead operate on an input of a set
of bounded parameters and iterations. These models can then be used to specify
a model, then use the model checker to find counter examples of a given
assertion. If a counter example can't be found, then the model may be valid.

Alloy in particular is an expressive, human readable language for formal
modeling. It also has a nice visualizer which can show counter examples, aiding
in development and understanding.

Alloy is useful as when used during upfront software design (or even after the
fact), one can create a formal model of a software system to gain better
confidence re the correctness of a system. The model can then be translated
to code. Many times, writing the model (either before or after the code) can
help one to better understand a given software system. Models can also be used
to specify protocols in p2p systems such as Lightning (as it supports temporal
logic, which enables creation of state machines and other interactive transcripts),
serving as a complement to a written specification.

Linear Fee Function

In this PR, we add the first Alloy model, for the Linear Fee
Function fee bumping mechanism in lnd.

This model was inspired by a bug fix, due to an off-by-one error in the
original code: #8741.

The bug in the original code was fixed in this PR:
#8751.

Model & Bug Fix Walk-through

The model includes an assertion that captures the essence of the bug:
max_fee_rate_before_deadline:

// max_fee_rate_before_deadline is the main assertion in this model. This
// captures a model violation for our fee function, but only if the line in
// fee_rate_at_position is uncommented.
//
// In this assertion, we declare that if we have a fee function that has a conf
// target of 4 (we want a few fee bumps), and we bump to the final block, then
// at that point our current fee rate is the ending fee rate. In the original
// code, assertion isn't upheld, due to an off by one error.
assert max_fee_rate_before_deadline {
  always req_num_blocks_to_conf[4] => bump_to_final_block => eventually (
    all f: LinearFeeFunction | f.position = f.width.sub[1] &&
                               f.currentFeeRate = f.endingFeeRate
  )
}

We can modify the model to find the bug described in the original issue.

First, we modify the model by forcing a check on the
max_fee_rate_before_deadline assertion:
alloy check max_fee_rate_before_deadline
Next, we'll modify the fee_rate_at_position predicate to re-introduce
the off by one error:
alloy p >= f.width => f.endingFeeRate // -- NOTE: Uncomment this to re-introduce the original bug.

If we hit Execute in the Alloy Analyzer, then we get a counter example:

We can hit Show in the analyzer to visualize it:

We can see that even though we're one block (position) before the deadline
(width), our fee rate isn't at the ending fee rate yet.

If we modify the fee_rate_at_position to have the correct logic:

p >= f.width.sub[1] => f.endingFeeRate

Then Alloy is unable to find any counterexamples:

coderabbitai · 2024-07-27T02:12:16Z

Important

Review skipped

Auto reviews are limited to specific labels.

Labels to auto review (1)

llm-review

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share

Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai generate interesting stats about this repository and render them as a table.
- @coderabbitai show all the console.log statements in this repository.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (invoked as PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Additionally, you can add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

morehouse · 2024-07-29T17:08:26Z

How does this improve over fuzzing?

At least for the fee function bug it seems a fuzz test would be simpler and more readable.

Roasbeef · 2024-07-29T19:32:11Z

How does this improve over fuzzing?

I don't think one can necessarily compare them like that. This is a simplified formal specification of the fee bumping algorithm, it exists independent of the code implementing the algorithm, but can be used to gain more confidence in the algorithm by leveraging the Alloy model checker. Fuzzing can help us find bugs in our code, formal modeling can help us find defects in our design.

At least for the fee function bug it seems a fuzz test would be simpler and more readable.

Consider if this was written before/during the original implementation, the model checker would've found a counter example before the code was even written. Fuzzing operates on an entirely different level of abstraction, you can't fuzz the design of a system. Formal methods are yet another tool in our tool box which includes: fuzzing, property based tests, mutation testing, etc, etc.

I recommend checking out these blog posts to help answer the "why" behind my motivations:

Using Formal Methods at Work
- With this PR, we're at the "Showcasing a bug" step.
Why Don't People Use Formal Methods
Formally Modeling Database Migrations
- A more hands on post that shows how one can use Alloy to specify and analyze a database migration before the code is even written.

Zooming out: we have some components in the LN spec that after all this time, are still somewhat poorly understood, lack comprehensive test vectors, and/or have insufficient specifications. Tools like Alloy can allow us to elevate the spec further, capturing the desired semantics in a formal model, with the ability to rule out certain invariants, and also serve as another guide for implementers.

One such example is the channel state machine, which is pretty much the cornerstone of LN interactions. We can use tools such as Alloy to better specify a model of the channel state machine, which can increase our understanding as a hole, and potentially find new gaps in the channel state machine beyond what's commonly known by implementors. Alloy supports temporal logic, so we can model both expected execution, and also error cases. Consider the following fragment that could be used to verify invariants around sending empty sigs in the protocol:

assert empty_sig_leads_to_error {
    always no pending_changes
    eventually (some commit_dance)
    eventually (some empty_sig_errors)
}

From here I plan to write a trivial model for the way ping/ping works in the protocol to get a better feel of how to best model message passing. Then I plan to write a model for the stfu protocol, as a more advanced warm up, before eventually tackling the LN state machine. I think a model of splicing would also be very useful, development uncovered a number of edge cases which right now are demonstrated in the form of somewhat elaborate acsii diagrams. We can capture all this with a formal model, to better advance our understanding, and potentially even find more subtle bugs/interactions.

As far as readability goes, IMO Alloy is the most readable of any of the formal specification languages I've come across. Compare the syntax in this PR to TLA+:

VARIABLES counter, pc

vars == << counter, pc >>

ProcSet == (Threads)

Init == (* Global variables *)
        /\ counter = 0
        /\ pc = [self \in ProcSet |-> "IncCounter"]

IncCounter(self) == /\ pc[self] = "IncCounter"
                    /\ counter' = counter + 1
                    /\ pc' = [pc EXCEPT ![self] = "Done"]

thread(self) == IncCounter(self)

(* Allow infinite stuttering to prevent deadlock on termination. *)
Terminating == /\ \A self \in ProcSet: pc[self] = "Done"
               /\ UNCHANGED vars

Next == (\E self \in Threads: thread(self))
           \/ Terminating

Spec == Init /\ [][Next]_vars

It may look unfamiliar to you, but that's because you don't yet have exposure to it, just as any new langauge.

ProofOfKeags

Super exciting to have one of these to look at! I read all of it, and have some questions.

docs/alloy-models/README.md

docs/alloy-models/linear-fee-function/linear-fee.als

ProofOfKeags · 2024-07-31T01:02:36Z

docs/alloy-models/linear-fee-function/linear-fee.als

+// bump_to_completion is a predicate that can be used to force fee_bump events
+// to be emitted until we're right at our confirmation deadline.
+pred bump_to_completion {
+  always (all f: LinearFeeFunction | f.position < f.width => eventually (some fee_bump))


Do you need the eventually here? can't you just use set here to capture the no bump case. Or are you trying to say that we must eventually get at least one fee_bump every time position < width?

If I remove the eventually then even just the run example at the very end fails. However, It looks non-essential for bump_to_final_block, as that doesn't have a noop case of no fee bump as you've noted.

Would need to experiment more, but I think removing it for bump_to_completion fails as if you have an init, then stutter f.position < f.width is true, but we can have another stutter right after that, so in the exact time step that it's true, we don't yet have a fee_bump.

docs/alloy-models/linear-fee-function/linear-fee.als

Roasbeef · 2024-08-01T01:41:51Z

Posted revision 3: https://reviewable.io/reviews/lightningnetwork/lnd/8943

yyforyongyu

Left a few comments, still reading through the tutorials🤓 I think in addition to being used to analyze the specs, it should also be helpful for all the subsystems used in lnd?

yyforyongyu · 2024-08-01T14:22:08Z

docs/alloy-models/README.md

@@ -0,0 +1,50 @@
+# Alloy Models


docs/alloy-models/linear-fee-function/linear-fee.als

Roasbeef · 2024-08-02T00:31:43Z

I think in addition to being used to analyze the specs, it should also be helpful for all the subsystems used in lnd?

Yep! Depending at which point in the design/implementation/testing/review cycle we insert things in, it can help to refine thinking of a new system, or even help model concurrent interactions with other systems. The blog I linked above even has an example where they use it to model a DB migration, and find some issues with the design in general. This post also has good suggestion on how to gradually integrate into your workflow: https://www.hillelwayne.com/post/using-formal-methods/

Part of the learning curve is figuring out which level of abstraction is sufficient for the model. Eg: if we were trying to model the channel state machine, we could add things like: payment hashes, commit txn construction, signs, etc. Or, we can have the model operate at a higher level, and use a more abstract notion of a inbox/queue for new changes and acked/unacked set, etc.

One other example that pops to mind would be some of the repeated bug fixes and refactoring we've done w.r.t the router's payment/HTLC dispatch. So how to handle timeouts, updating the main state, etc, etc. Nice part is that it can also idrectly help to generate diagrams of everything via the visualizer. The visualizer also has a lot of settings to make what it generates more useful w/ colors, diff shapes, focusing on certain aspect, etc. etc

Roasbeef · 2024-08-02T00:40:25Z

still reading through the tutorials🤓

If you haven't already, it'll also be useful to download the Analyzer and run the model in it. Throughput the process I either had Vim open and then used the Analyzer to run stuff (using the reload), or just ran things directly in the Analyzer. It also has a REPL like interface that can be used to debug stuff or just see how it would evaluate certain expressions/statements.

In this commit, we add a model for the linear fee function we use in lnd for fee bumping. This models can be used to reproduce the issue reported in lightningnetwork#8741, and can also be shown that that bug fix resolves a counter example found by the model checker.

Roasbeef · 2024-08-02T00:43:11Z

Also right now CI doesn't do much, but in the future we can have it run the Java binary for Alloy on the models checked in to make sure they don't have obvious counter examples and have valid syntax, etc.

lightninglabs-deploy · 2024-08-09T01:03:57Z

@ProofOfKeags: review reminder
@Roasbeef, remember to re-request review from reviewers when ready

yyforyongyu

LGTM🦾 Still walking through the suggested tutorials atm, maybe we could make this a recommended approach when designing future subsystems.

Roasbeef added testing Improvements/modifications to the test suite fees Related to the fees paid for transactions (both LN and funding/commitment transactions) alloy formal methods labels Jul 27, 2024

Roasbeef added no-itest no-changelog labels Jul 27, 2024

Roasbeef requested review from ProofOfKeags and yyforyongyu July 30, 2024 03:43

ProofOfKeags requested changes Jul 31, 2024

View reviewed changes

Roasbeef force-pushed the alloy-linear-fee-model branch from 3a49d3e to 50053c8 Compare August 1, 2024 01:40

Roasbeef requested a review from ProofOfKeags August 1, 2024 01:40

yyforyongyu reviewed Aug 1, 2024

View reviewed changes

Roasbeef added 2 commits August 1, 2024 17:42

docs/alloy-models: add initial README.md for Alloy models

7c61dc6

Roasbeef force-pushed the alloy-linear-fee-model branch from 50053c8 to 9aa7e75 Compare August 2, 2024 00:43

Roasbeef added the size/kilo medium, proper context needed, less than 1000 lines label Aug 2, 2024

Roasbeef requested a review from yyforyongyu August 9, 2024 01:42

yyforyongyu approved these changes Aug 12, 2024

View reviewed changes

Roasbeef merged commit c0420fe into lightningnetwork:master Aug 13, 2024
19 of 21 checks passed

Roasbeef mentioned this pull request Aug 14, 2024

Lightning Dev Summit Topics lightning/bolts#1171

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs/alloy-models: add new folder for Alloy models along w/ model for Linear Fee Function bug fix #8943

docs/alloy-models: add new folder for Alloy models along w/ model for Linear Fee Function bug fix #8943

Roasbeef commented Jul 27, 2024 •

edited

Loading

coderabbitai bot commented Jul 27, 2024

Review skipped

Chat

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

morehouse commented Jul 29, 2024

Roasbeef commented Jul 29, 2024 •

edited

Loading

ProofOfKeags left a comment

ProofOfKeags Jul 31, 2024

Roasbeef Aug 1, 2024

Roasbeef commented Aug 1, 2024

yyforyongyu left a comment

yyforyongyu Aug 1, 2024

Roasbeef commented Aug 2, 2024 •

edited

Loading

Roasbeef commented Aug 2, 2024

Roasbeef commented Aug 2, 2024

lightninglabs-deploy commented Aug 9, 2024

yyforyongyu left a comment

docs/alloy-models: add new folder for Alloy models along w/ model for Linear Fee Function bug fix #8943

docs/alloy-models: add new folder for Alloy models along w/ model for Linear Fee Function bug fix #8943

Conversation

Roasbeef commented Jul 27, 2024 • edited Loading

Alloy Models

Linear Fee Function

Model & Bug Fix Walk-through

coderabbitai bot commented Jul 27, 2024

Review skipped

Chat

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

morehouse commented Jul 29, 2024

Roasbeef commented Jul 29, 2024 • edited Loading

ProofOfKeags left a comment

Choose a reason for hiding this comment

ProofOfKeags Jul 31, 2024

Choose a reason for hiding this comment

Roasbeef Aug 1, 2024

Choose a reason for hiding this comment

Roasbeef commented Aug 1, 2024

yyforyongyu left a comment

Choose a reason for hiding this comment

yyforyongyu Aug 1, 2024

Choose a reason for hiding this comment

Roasbeef commented Aug 2, 2024 • edited Loading

Roasbeef commented Aug 2, 2024

Roasbeef commented Aug 2, 2024

lightninglabs-deploy commented Aug 9, 2024

yyforyongyu left a comment

Choose a reason for hiding this comment

Roasbeef commented Jul 27, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

Roasbeef commented Jul 29, 2024 •

edited

Loading

Roasbeef commented Aug 2, 2024 •

edited

Loading