Better Encapsulate Scoring and Cache Point Count #3188

TheBlueMatt · 2024-07-17T14:41:15Z

When we go to score a channel using the historical liquidity data,
the first thing we do is step through all the valid bucket
combinations, multiply the min and max bucket, and then add them
together to calculate the total number of points tracked. This
isn't a free operation, and for scorers without much data it
represents a large part of the total time spent scoring during
routefinding.

Thus, here we cache this value, updating it every time the buckets
are updated. In order to do so, we first have to clean up scorer.rs, which has been needed for some time anyway, improving encapsulation so that caching is more reasonable.

codecov · 2024-07-17T15:22:38Z

Codecov Report

Attention: Patch coverage is 95.23810% with 8 lines in your changes missing coverage. Please review.

Project coverage is 89.70%. Comparing base (ccce9d9) to head (4fd07ec).
Report is 23 commits behind head on main.

Files	Patch %	Lines
lightning/src/routing/scoring.rs	94.32%	8 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3188      +/-   ##
==========================================
+ Coverage   89.68%   89.70%   +0.01%     
==========================================
  Files         124      125       +1     
  Lines      102386   102458      +72     
  Branches   102386   102458      +72     
==========================================
+ Hits        91827    91912      +85     
+ Misses       7857     7843      -14     
- Partials     2702     2703       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

arik-so · 2024-07-30T14:32:37Z

lightning/src/routing/log_approx.rs

@@ -0,0 +1,499 @@
+//! Approximation of log_10 using a lookup table.


do you feel like log approximations might wanna go in util, or will we only ever use them for scoring?

Not sure why we'd ever use them elsewhere, and, really, the util module is an annoying grab-bag of random stuff that should be avoided where possible IMO.

lightning/src/routing/scoring.rs

valentinewallace

Basically LGTM

lightning/src/routing/scoring.rs

TheBlueMatt · 2024-08-15T00:46:53Z

LMK if I should squash.

arik-so · 2024-08-15T07:06:08Z

Yup, feel free to squash.

TheBlueMatt · 2024-08-15T13:49:49Z

Squashed without further changes.

arik-so · 2024-08-16T06:21:18Z

lightning/src/routing/log_approx.rs

Given that this is a new file that was just introduced in the previous commit, is this even worth a separate commit?

Also note that the implication is that the prior commit would fail CI in isolation.

Github didn't show what this is in reference to, but I assume its the rustfmt commit. Yes, its absolutely worth it, because with it you can verify the move-only by doing git show --color-moved without having to check anything.

arik-so · 2024-08-16T06:23:42Z

lightning/src/routing/scoring.rs

@@ -1663,6 +1654,29 @@ mod bucketed_history {
 			}
 		}

+		pub(super) fn has_datapoints(&self) -> bool {


Couldn't the introduction of these methods be split into a separate commit from the bucket isolation refactor component?

I mean, sure, but that commit is only +85, its not exactly huge. Would you like me to change it?

nah, it's fine

lightning/src/routing/scoring.rs

TheBlueMatt · 2024-08-16T13:58:42Z

Rewrote the last commit message but didn't change the diff at all.

TheBlueMatt · 2024-08-19T14:24:24Z

Rebased

In the coming commits we'll isolate historical bucket logic slightly further, allowing us to cache some state. This is the first step towards that, storing the historical liquidity information in a new `HistoricalLiquidityTracker` rather than in the general `ChannelLiquidity`.

In a comming commit we'll cache some additional data in the historical bucket tracker. In order to do so, here we isolate the buckets themselves into the `bucketed_history` module, reducing the possibility of accidentally updating them directly without updating caches.

Rather than storing the two direction's buckets in `HistoricalMinMaxBuckets` (renamed `DirectedHistoricalLiquidityTracker`), we store a single reference to the `HistoricalLiquidityTracker` as well as the direction bool. This will allow us in the next commit to reference fields in the `HistoricalLiquidityTracker` aside from the two directions.

When we go to score a channel using the historical liquidity data, the first thing we do is step through all the valid bucket combinations, multiply the min and max bucket, and then add them together to calculate the total number of points tracked. This isn't a free operation, and for scorers without much data it represents a large part of the total time spent scoring during routefinding. Thus, here we cache this value, updating it every time the buckets are updated.

During routing, the majority of our time is spent in the scorer. Given the scorer isn't actually doing all that much computation, this means we're quite sensitive to memory latency. Thus, the cache lines our data sits on is incredibly important. Here, we manually lay out the `ChannelLiquidity` and `HistoricalLiquidityTracker` structs to ensure that we can do the non-historical scoring and skip historical scoring for channels with insufficient data by just looking at the same cache line the channel's SCID is on. Sadly, to do the full historical scoring we need to load a second 128-byte cache line pair, but we have some time to get there. We might consider issuing a preload instruction in the future. This improves performance a few percent.

TheBlueMatt changed the title ~~Better Encapsulate Scoring and Precalculate Point Count~~ Better Encapsulate Scoring and Cache Point Count Jul 17, 2024

TheBlueMatt force-pushed the 2023-12-cache-scoring-points branch from 524caf3 to 74bee94 Compare July 17, 2024 20:01

arik-so reviewed Jul 30, 2024

View reviewed changes

arik-so reviewed Aug 14, 2024

View reviewed changes

lightning/src/routing/scoring.rs Show resolved Hide resolved

valentinewallace reviewed Aug 14, 2024

View reviewed changes

lightning/src/routing/scoring.rs Outdated Show resolved Hide resolved

lightning/src/routing/scoring.rs Outdated Show resolved Hide resolved

TheBlueMatt force-pushed the 2023-12-cache-scoring-points branch from 9216b85 to 7798c81 Compare August 15, 2024 00:46

TheBlueMatt force-pushed the 2023-12-cache-scoring-points branch from 7798c81 to 6680b97 Compare August 15, 2024 13:49

valentinewallace previously approved these changes Aug 15, 2024

View reviewed changes

arik-so reviewed Aug 16, 2024

View reviewed changes

lightning/src/routing/scoring.rs Show resolved Hide resolved

TheBlueMatt force-pushed the 2023-12-cache-scoring-points branch from 6680b97 to a5760c8 Compare August 16, 2024 13:57

TheBlueMatt added 2 commits August 19, 2024 14:24

Move log approximation from scoring.rs to its own file

1c84d3b

Format log_approx.rs per rustfmt

dcc84de

TheBlueMatt dismissed valentinewallace’s stale review via 342e955 August 19, 2024 14:24

TheBlueMatt force-pushed the 2023-12-cache-scoring-points branch from a5760c8 to 342e955 Compare August 19, 2024 14:24

TheBlueMatt added 5 commits August 19, 2024 14:25

TheBlueMatt force-pushed the 2023-12-cache-scoring-points branch from 342e955 to 4fd07ec Compare August 19, 2024 14:26

arik-so approved these changes Aug 19, 2024

View reviewed changes

valentinewallace approved these changes Aug 19, 2024

View reviewed changes

TheBlueMatt merged commit 05ed0db into lightningdevkit:main Aug 19, 2024
16 of 21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better Encapsulate Scoring and Cache Point Count #3188

Better Encapsulate Scoring and Cache Point Count #3188

TheBlueMatt commented Jul 17, 2024

codecov bot commented Jul 17, 2024 •

edited

Loading

arik-so Jul 30, 2024

TheBlueMatt Aug 5, 2024

valentinewallace left a comment

TheBlueMatt commented Aug 15, 2024

arik-so commented Aug 15, 2024

TheBlueMatt commented Aug 15, 2024

arik-so Aug 16, 2024

TheBlueMatt Aug 16, 2024

arik-so Aug 16, 2024

TheBlueMatt Aug 16, 2024

arik-so Aug 19, 2024

TheBlueMatt commented Aug 16, 2024

TheBlueMatt commented Aug 19, 2024

		@@ -0,0 +1,499 @@
		//! Approximation of log_10 using a lookup table.

Better Encapsulate Scoring and Cache Point Count #3188

Better Encapsulate Scoring and Cache Point Count #3188

Conversation

TheBlueMatt commented Jul 17, 2024

codecov bot commented Jul 17, 2024 • edited Loading

Codecov Report

arik-so Jul 30, 2024

Choose a reason for hiding this comment

TheBlueMatt Aug 5, 2024

Choose a reason for hiding this comment

valentinewallace left a comment

Choose a reason for hiding this comment

TheBlueMatt commented Aug 15, 2024

arik-so commented Aug 15, 2024

TheBlueMatt commented Aug 15, 2024

arik-so Aug 16, 2024

Choose a reason for hiding this comment

TheBlueMatt Aug 16, 2024

Choose a reason for hiding this comment

arik-so Aug 16, 2024

Choose a reason for hiding this comment

TheBlueMatt Aug 16, 2024

Choose a reason for hiding this comment

arik-so Aug 19, 2024

Choose a reason for hiding this comment

TheBlueMatt commented Aug 16, 2024

TheBlueMatt commented Aug 19, 2024

codecov bot commented Jul 17, 2024 •

edited

Loading