Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Reduce stack size after optimizations #6698

Merged
merged 2 commits into from
May 27, 2024
Merged

Conversation

sirasistant
Copy link
Collaborator

@sirasistant sirasistant commented May 27, 2024

After this PR we should be able to set the stack size back down to the previous value.

@sirasistant sirasistant changed the title perf: Reduce stack size after optimizations feat: Reduce stack size after optimizations May 27, 2024
@AztecBot
Copy link
Collaborator

Benchmark results

Metrics with a significant change:

  • protocol_circuit_witness_generation_time_in_ms (private-kernel-reset-small): 1,989 (-16%)
Detailed results

All benchmarks are run on txs on the Benchmarking contract on the repository. Each tx consists of a batch call to create_note and increment_balance, which guarantees that each tx has a private call, a nested private call, a public call, and a nested public call, as well as an emitted private note, an unencrypted log, and public storage read and write.

This benchmark source data is available in JSON format on S3 here.

Proof generation

Each column represents the number of threads used in proof generation.

Metric 1 threads 4 threads 16 threads 32 threads 64 threads
proof_construction_time_sha256 5,712 (-1%) 1,544 (-1%) 711 (-1%) 756 (-4%) 773 (-1%)

L2 block published to L1

Each column represents the number of txs on an L2 block published to L1.

Metric 8 txs 32 txs 64 txs
l1_rollup_calldata_size_in_bytes 1,412 1,412 1,412
l1_rollup_calldata_gas 9,464 9,476 9,476
l1_rollup_execution_gas 616,105 616,117 616,117
l2_block_processing_time_in_ms 1,281 4,777 (+1%) 9,489
l2_block_building_time_in_ms 44,910 (+2%) 177,767 (+1%) 355,004 (+2%)
l2_block_rollup_simulation_time_in_ms 44,745 (+2%) 177,143 (+1%) 353,788 (+2%)
l2_block_public_tx_process_time_in_ms 24,166 (+5%) 101,937 (+3%) 205,722 (+2%)

L2 chain processing

Each column represents the number of blocks on the L2 chain where each block has 16 txs.

Metric 3 blocks 5 blocks
node_history_sync_time_in_ms 9,581 (+3%) 14,527 (+2%)
node_database_size_in_bytes 14,487,632 21,356,624
pxe_database_size_in_bytes 18,071 29,868

Circuits stats

Stats on running time and I/O sizes collected for every kernel circuit run across all benchmarks.

Circuit simulation_time_in_ms witness_generation_time_in_ms proving_time_in_ms input_size_in_bytes output_size_in_bytes proof_size_in_bytes num_public_inputs size_in_gates
private-kernel-init 160 (+1%) 3,560 (+3%) 26,732 (+5%) 20,630 64,614 89,536 2,731 1,048,576
private-kernel-inner 620 (+1%) 4,188 (+4%) 51,347 92,318 64,614 89,536 2,731 2,097,152
private-kernel-tail 566 (+1%) 2,818 (+3%) 41,888 (-5%) 96,541 77,498 10,656 266 2,097,152
base-parity 6.45 (+2%) 1,205 (+9%) 2,794 128 64.0 2,208 2.00 131,072
root-parity 49.3 (+1%) 68.7 (+6%) 50,344 (-1%) 27,084 64.0 2,720 18.0 2,097,152
base-rollup 714 (-6%) 2,330 (+4%) 87,438 119,610 756 3,648 47.0 4,194,304
root-rollup 110 (+2%) 75.8 (-1%) 23,898 (+2%) 25,297 620 3,456 41.0 1,048,576
public-kernel-app-logic 522 (+1%) 3,014 (+5%) 51,475 (+1%) 104,941 86,302 114,784 3,520 2,097,152
public-kernel-tail 1,082 (+1%) 23,636 (+2%) 179,059 (-1%) 395,386 7,522 10,656 266 8,388,608
private-kernel-reset-small 590 (+1%) ⚠️ 1,989 (-16%) 51,683 (+2%) 120,733 64,614 89,536 2,731 2,097,152
merge-rollup 28.8 (+1%) N/A N/A 16,534 756 N/A N/A N/A
public-kernel-setup 628 (+1%) N/A N/A 104,941 86,302 N/A N/A N/A
public-kernel-teardown 535 (+1%) N/A N/A 104,941 86,302 N/A N/A N/A
private-kernel-tail-to-public N/A 8,404 (+3%) 106,590 (+1%) N/A N/A 114,784 3,520 4,194,304

Stats on running time collected for app circuits

Function input_size_in_bytes output_size_in_bytes witness_generation_time_in_ms proof_size_in_bytes proving_time_in_ms size_in_gates num_public_inputs
ContractClassRegisterer:register 1,344 9,944 460 N/A N/A N/A N/A
ContractInstanceDeployer:deploy 1,408 9,944 41.2 N/A N/A N/A N/A
MultiCallEntrypoint:entrypoint 1,920 9,944 1,411 (+1%) N/A N/A N/A N/A
SchnorrAccount:constructor 1,312 9,944 972 (+1%) N/A N/A N/A N/A
SchnorrAccount:entrypoint 2,304 9,944 2,075 (+1%) 16,768 55,898 (+3%) 2,097,152 457
Token:privately_mint_private_note 1,280 9,944 1,091 (+2%) N/A N/A N/A N/A
Token:transfer 1,376 9,944 3,999 (+2%) 16,768 60,303 2,097,152 457
Benchmarking:create_note 1,312 9,944 950 (+1%) N/A N/A N/A N/A
FPC:fee_entrypoint_public 1,344 9,944 223 N/A N/A N/A N/A
SchnorrAccount:spend_private_authwit 1,280 9,944 78.0 (-9%) N/A N/A N/A N/A
Token:unshield 1,376 9,944 3,257 (+1%) N/A N/A N/A N/A
FPC:fee_entrypoint_private 1,376 9,944 4,031 (+1%) N/A N/A N/A N/A

Tree insertion stats

The duration to insert a fixed batch of leaves into each tree type.

Metric 1 leaves 16 leaves 64 leaves 128 leaves 512 leaves 1024 leaves 2048 leaves 4096 leaves 32 leaves
batch_insert_into_append_only_tree_16_depth_ms 10.4 (+1%) 17.0 (+1%) N/A N/A N/A N/A N/A N/A N/A
batch_insert_into_append_only_tree_16_depth_hash_count 16.7 31.8 N/A N/A N/A N/A N/A N/A N/A
batch_insert_into_append_only_tree_16_depth_hash_ms 0.604 (+1%) 0.519 (+1%) N/A N/A N/A N/A N/A N/A N/A
batch_insert_into_append_only_tree_32_depth_ms N/A N/A 48.5 (+2%) 76.2 244 (-2%) 479 (+3%) 922 (+2%) 1,825 (-1%) N/A
batch_insert_into_append_only_tree_32_depth_hash_count N/A N/A 95.9 159 543 1,055 2,079 4,127 N/A
batch_insert_into_append_only_tree_32_depth_hash_ms N/A N/A 0.495 (+1%) 0.469 0.443 (-2%) 0.446 (+2%) 0.437 (+2%) 0.436 (-1%) N/A
batch_insert_into_indexed_tree_20_depth_ms N/A N/A 58.5 (+2%) 112 (+1%) 353 (-1%) 705 (+3%) 1,374 (+2%) 2,737 (+1%) N/A
batch_insert_into_indexed_tree_20_depth_hash_count N/A N/A 106 208 692 1,363 2,707 5,395 N/A
batch_insert_into_indexed_tree_20_depth_hash_ms N/A N/A 0.506 (+2%) 0.503 (+1%) 0.477 0.483 (+2%) 0.475 (+2%) 0.475 (+1%) N/A
batch_insert_into_indexed_tree_40_depth_ms N/A N/A N/A N/A N/A N/A N/A N/A 62.4 (+2%)
batch_insert_into_indexed_tree_40_depth_hash_count N/A N/A N/A N/A N/A N/A N/A N/A 107
batch_insert_into_indexed_tree_40_depth_hash_ms N/A N/A N/A N/A N/A N/A N/A N/A 0.553 (+1%)

Miscellaneous

Transaction sizes based on how many contract classes are registered in the tx.

Metric 0 registered classes 1 registered classes
tx_size_in_bytes 83,794 665,117

Transaction size based on fee payment method

| Metric | |
| - | |

@sirasistant sirasistant enabled auto-merge (squash) May 27, 2024 16:40
@sirasistant sirasistant merged commit 3502ccd into master May 27, 2024
86 checks passed
@sirasistant sirasistant deleted the arv/reduce_stack branch May 27, 2024 17:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants