mirror of
https://github.com/sigp/lighthouse.git
synced 2026-03-15 19:02:42 +00:00
## Overview
This rather extensive PR achieves two primary goals:
1. Uses the finalized/justified checkpoints of fork choice (FC), rather than that of the head state.
2. Refactors fork choice, block production and block processing to `async` functions.
Additionally, it achieves:
- Concurrent forkchoice updates to the EL and cache pruning after a new head is selected.
- Concurrent "block packing" (attestations, etc) and execution payload retrieval during block production.
- Concurrent per-block-processing and execution payload verification during block processing.
- The `Arc`-ification of `SignedBeaconBlock` during block processing (it's never mutated, so why not?):
- I had to do this to deal with sending blocks into spawned tasks.
- Previously we were cloning the beacon block at least 2 times during each block processing, these clones are either removed or turned into cheaper `Arc` clones.
- We were also `Box`-ing and un-`Box`-ing beacon blocks as they moved throughout the networking crate. This is not a big deal, but it's nice to avoid shifting things between the stack and heap.
- Avoids cloning *all the blocks* in *every chain segment* during sync.
- It also has the potential to clean up our code where we need to pass an *owned* block around so we can send it back in the case of an error (I didn't do much of this, my PR is already big enough 😅)
- The `BeaconChain::HeadSafetyStatus` struct was removed. It was an old relic from prior merge specs.
For motivation for this change, see https://github.com/sigp/lighthouse/pull/3244#issuecomment-1160963273
## Changes to `canonical_head` and `fork_choice`
Previously, the `BeaconChain` had two separate fields:
```
canonical_head: RwLock<Snapshot>,
fork_choice: RwLock<BeaconForkChoice>
```
Now, we have grouped these values under a single struct:
```
canonical_head: CanonicalHead {
cached_head: RwLock<Arc<Snapshot>>,
fork_choice: RwLock<BeaconForkChoice>
}
```
Apart from ergonomics, the only *actual* change here is wrapping the canonical head snapshot in an `Arc`. This means that we no longer need to hold the `cached_head` (`canonical_head`, in old terms) lock when we want to pull some values from it. This was done to avoid deadlock risks by preventing functions from acquiring (and holding) the `cached_head` and `fork_choice` locks simultaneously.
## Breaking Changes
### The `state` (root) field in the `finalized_checkpoint` SSE event
Consider the scenario where epoch `n` is just finalized, but `start_slot(n)` is skipped. There are two state roots we might in the `finalized_checkpoint` SSE event:
1. The state root of the finalized block, which is `get_block(finalized_checkpoint.root).state_root`.
4. The state root at slot of `start_slot(n)`, which would be the state from (1), but "skipped forward" through any skip slots.
Previously, Lighthouse would choose (2). However, we can see that when [Teku generates that event](de2b2801c8/data/beaconrestapi/src/main/java/tech/pegasys/teku/beaconrestapi/handlers/v1/events/EventSubscriptionManager.java (L171-L182)) it uses [`getStateRootFromBlockRoot`](de2b2801c8/data/provider/src/main/java/tech/pegasys/teku/api/ChainDataProvider.java (L336-L341)) which uses (1).
I have switched Lighthouse from (2) to (1). I think it's a somewhat arbitrary choice between the two, where (1) is easier to compute and is consistent with Teku.
## Notes for Reviewers
I've renamed `BeaconChain::fork_choice` to `BeaconChain::recompute_head`. Doing this helped ensure I broke all previous uses of fork choice and I also find it more descriptive. It describes an action and can't be confused with trying to get a reference to the `ForkChoice` struct.
I've changed the ordering of SSE events when a block is received. It used to be `[block, finalized, head]` and now it's `[block, head, finalized]`. It was easier this way and I don't think we were making any promises about SSE event ordering so it's not "breaking".
I've made it so fork choice will run when it's first constructed. I did this because I wanted to have a cached version of the last call to `get_head`. Ensuring `get_head` has been run *at least once* means that the cached values doesn't need to wrapped in an `Option`. This was fairly simple, it just involved passing a `slot` to the constructor so it knows *when* it's being run. When loading a fork choice from the store and a slot clock isn't handy I've just used the `slot` that was saved in the `fork_choice_store`. That seems like it would be a faithful representation of the slot when we saved it.
I added the `genesis_time: u64` to the `BeaconChain`. It's small, constant and nice to have around.
Since we're using FC for the fin/just checkpoints, we no longer get the `0x00..00` roots at genesis. You can see I had to remove a work-around in `ef-tests` here: b56be3bc2. I can't find any reason why this would be an issue, if anything I think it'll be better since the genesis-alias has caught us out a few times (0x00..00 isn't actually a real root). Edit: I did find a case where the `network` expected the 0x00..00 alias and patched it here: 3f26ac3e2.
You'll notice a lot of changes in tests. Generally, tests should be functionally equivalent. Here are the things creating the most diff-noise in tests:
- Changing tests to be `tokio::async` tests.
- Adding `.await` to fork choice, block processing and block production functions.
- Refactor of the `canonical_head` "API" provided by the `BeaconChain`. E.g., `chain.canonical_head.cached_head()` instead of `chain.canonical_head.read()`.
- Wrapping `SignedBeaconBlock` in an `Arc`.
- In the `beacon_chain/tests/block_verification`, we can't use the `lazy_static` `CHAIN_SEGMENT` variable anymore since it's generated with an async function. We just generate it in each test, not so efficient but hopefully insignificant.
I had to disable `rayon` concurrent tests in the `fork_choice` tests. This is because the use of `rayon` and `block_on` was causing a panic.
Co-authored-by: Mac L <mjladson@pm.me>
436 lines
20 KiB
Rust
436 lines
20 KiB
Rust
use beacon_chain::{
|
|
attestation_verification::Error as AttnError,
|
|
sync_committee_verification::Error as SyncCommitteeError,
|
|
};
|
|
use fnv::FnvHashMap;
|
|
pub use lighthouse_metrics::*;
|
|
use lighthouse_network::{
|
|
peer_manager::peerdb::client::ClientKind, types::GossipKind, BandwidthSinks, GossipTopic,
|
|
Gossipsub, NetworkGlobals,
|
|
};
|
|
use std::sync::Arc;
|
|
use strum::IntoEnumIterator;
|
|
use types::EthSpec;
|
|
|
|
lazy_static! {
|
|
|
|
pub static ref BEACON_BLOCK_MESH_PEERS_PER_CLIENT: Result<IntGaugeVec> =
|
|
try_create_int_gauge_vec(
|
|
"block_mesh_peers_per_client",
|
|
"Number of mesh peers for BeaconBlock topic per client",
|
|
&["Client"]
|
|
);
|
|
|
|
pub static ref BEACON_AGGREGATE_AND_PROOF_MESH_PEERS_PER_CLIENT: Result<IntGaugeVec> =
|
|
try_create_int_gauge_vec(
|
|
"beacon_aggregate_and_proof_mesh_peers_per_client",
|
|
"Number of mesh peers for BeaconAggregateAndProof topic per client",
|
|
&["Client"]
|
|
);
|
|
|
|
/*
|
|
* Attestation subnet subscriptions
|
|
*/
|
|
pub static ref SUBNET_SUBSCRIPTION_REQUESTS: Result<IntCounter> = try_create_int_counter(
|
|
"validator_attestation_subnet_subscriptions_total",
|
|
"Count of validator attestation subscription requests."
|
|
);
|
|
pub static ref SUBNET_SUBSCRIPTION_AGGREGATOR_REQUESTS: Result<IntCounter> = try_create_int_counter(
|
|
"validator_subnet_subscriptions_aggregator_total",
|
|
"Count of validator subscription requests where the subscriber is an aggregator."
|
|
);
|
|
pub static ref SYNC_COMMITTEE_SUBSCRIPTION_REQUESTS: Result<IntCounter> = try_create_int_counter(
|
|
"validator_sync_committee_subnet_subscriptions_total",
|
|
"Count of validator sync committee subscription requests."
|
|
);
|
|
|
|
/*
|
|
* Gossip processor
|
|
*/
|
|
pub static ref BEACON_PROCESSOR_WORK_EVENTS_RX_COUNT: Result<IntCounterVec> = try_create_int_counter_vec(
|
|
"beacon_processor_work_events_rx_count",
|
|
"Count of work events received (but not necessarily processed)",
|
|
&["type"]
|
|
);
|
|
pub static ref BEACON_PROCESSOR_WORK_EVENTS_IGNORED_COUNT: Result<IntCounterVec> = try_create_int_counter_vec(
|
|
"beacon_processor_work_events_ignored_count",
|
|
"Count of work events purposefully ignored",
|
|
&["type"]
|
|
);
|
|
pub static ref BEACON_PROCESSOR_WORK_EVENTS_STARTED_COUNT: Result<IntCounterVec> = try_create_int_counter_vec(
|
|
"beacon_processor_work_events_started_count",
|
|
"Count of work events which have been started by a worker",
|
|
&["type"]
|
|
);
|
|
pub static ref BEACON_PROCESSOR_WORKER_TIME: Result<HistogramVec> = try_create_histogram_vec(
|
|
"beacon_processor_worker_time",
|
|
"Time taken for a worker to fully process some parcel of work.",
|
|
&["type"]
|
|
);
|
|
pub static ref BEACON_PROCESSOR_WORKERS_SPAWNED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_workers_spawned_total",
|
|
"The number of workers ever spawned by the gossip processing pool."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_WORKERS_ACTIVE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_workers_active_total",
|
|
"Count of active workers in the gossip processing pool."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_IDLE_EVENTS_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_idle_events_total",
|
|
"Count of idle events processed by the gossip processor manager."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_EVENT_HANDLING_SECONDS: Result<Histogram> = try_create_histogram(
|
|
"beacon_processor_event_handling_seconds",
|
|
"Time spent handling a new message and allocating it to a queue or worker."
|
|
);
|
|
// Gossip blocks.
|
|
pub static ref BEACON_PROCESSOR_GOSSIP_BLOCK_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_gossip_block_queue_total",
|
|
"Count of blocks from gossip waiting to be verified."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_GOSSIP_BLOCK_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_gossip_block_verified_total",
|
|
"Total number of gossip blocks verified for propagation."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_GOSSIP_BLOCK_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_gossip_block_imported_total",
|
|
"Total number of gossip blocks imported to fork choice, etc."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_GOSSIP_BLOCK_REQUEUED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_gossip_block_requeued_total",
|
|
"Total number of gossip blocks that arrived early and were re-queued for later processing."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_GOSSIP_BLOCK_EARLY_SECONDS: Result<Histogram> = try_create_histogram(
|
|
"beacon_processor_gossip_block_early_seconds",
|
|
"Whenever a gossip block is received early this metrics is set to how early that block was."
|
|
);
|
|
// Gossip Exits.
|
|
pub static ref BEACON_PROCESSOR_EXIT_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_exit_queue_total",
|
|
"Count of exits from gossip waiting to be verified."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_EXIT_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_exit_verified_total",
|
|
"Total number of voluntary exits verified for propagation."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_EXIT_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_exit_imported_total",
|
|
"Total number of voluntary exits imported to the op pool."
|
|
);
|
|
// Gossip proposer slashings.
|
|
pub static ref BEACON_PROCESSOR_PROPOSER_SLASHING_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_proposer_slashing_queue_total",
|
|
"Count of proposer slashings from gossip waiting to be verified."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_PROPOSER_SLASHING_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_proposer_slashing_verified_total",
|
|
"Total number of proposer slashings verified for propagation."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_PROPOSER_SLASHING_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_proposer_slashing_imported_total",
|
|
"Total number of proposer slashings imported to the op pool."
|
|
);
|
|
// Gossip attester slashings.
|
|
pub static ref BEACON_PROCESSOR_ATTESTER_SLASHING_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_attester_slashing_queue_total",
|
|
"Count of attester slashings from gossip waiting to be verified."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_ATTESTER_SLASHING_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_attester_slashing_verified_total",
|
|
"Total number of attester slashings verified for propagation."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_ATTESTER_SLASHING_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_attester_slashing_imported_total",
|
|
"Total number of attester slashings imported to the op pool."
|
|
);
|
|
// Rpc blocks.
|
|
pub static ref BEACON_PROCESSOR_RPC_BLOCK_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_rpc_block_queue_total",
|
|
"Count of blocks from the rpc waiting to be verified."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_RPC_BLOCK_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_rpc_block_imported_total",
|
|
"Total number of gossip blocks imported to fork choice, etc."
|
|
);
|
|
// Chain segments.
|
|
pub static ref BEACON_PROCESSOR_CHAIN_SEGMENT_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_chain_segment_queue_total",
|
|
"Count of chain segments from the rpc waiting to be verified."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_BACKFILL_CHAIN_SEGMENT_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_backfill_chain_segment_queue_total",
|
|
"Count of backfill chain segments from the rpc waiting to be verified."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_CHAIN_SEGMENT_SUCCESS_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_chain_segment_success_total",
|
|
"Total number of chain segments successfully processed."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_BACKFILL_CHAIN_SEGMENT_SUCCESS_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_backfill_chain_segment_success_total",
|
|
"Total number of chain segments successfully processed."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_CHAIN_SEGMENT_FAILED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_chain_segment_failed_total",
|
|
"Total number of chain segments that failed processing."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_BACKFILL_CHAIN_SEGMENT_FAILED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_backfill_chain_segment_failed_total",
|
|
"Total number of backfill chain segments that failed processing."
|
|
);
|
|
// Unaggregated attestations.
|
|
pub static ref BEACON_PROCESSOR_UNAGGREGATED_ATTESTATION_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_unaggregated_attestation_queue_total",
|
|
"Count of unagg. attestations waiting to be processed."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_UNAGGREGATED_ATTESTATION_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_unaggregated_attestation_verified_total",
|
|
"Total number of unaggregated attestations verified for gossip."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_UNAGGREGATED_ATTESTATION_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_unaggregated_attestation_imported_total",
|
|
"Total number of unaggregated attestations imported to fork choice, etc."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_UNAGGREGATED_ATTESTATION_REQUEUED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_unaggregated_attestation_requeued_total",
|
|
"Total number of unaggregated attestations that referenced an unknown block and were re-queued."
|
|
);
|
|
// Aggregated attestations.
|
|
pub static ref BEACON_PROCESSOR_AGGREGATED_ATTESTATION_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_aggregated_attestation_queue_total",
|
|
"Count of agg. attestations waiting to be processed."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_AGGREGATED_ATTESTATION_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_aggregated_attestation_verified_total",
|
|
"Total number of aggregated attestations verified for gossip."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_AGGREGATED_ATTESTATION_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_aggregated_attestation_imported_total",
|
|
"Total number of aggregated attestations imported to fork choice, etc."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_AGGREGATED_ATTESTATION_REQUEUED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_aggregated_attestation_requeued_total",
|
|
"Total number of aggregated attestations that referenced an unknown block and were re-queued."
|
|
);
|
|
// Sync committee messages.
|
|
pub static ref BEACON_PROCESSOR_SYNC_MESSAGE_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_sync_message_queue_total",
|
|
"Count of sync committee messages waiting to be processed."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_SYNC_MESSAGE_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_sync_message_verified_total",
|
|
"Total number of sync committee messages verified for gossip."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_SYNC_MESSAGE_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_sync_message_imported_total",
|
|
"Total number of sync committee messages imported to fork choice, etc."
|
|
);
|
|
// Sync contribution.
|
|
pub static ref BEACON_PROCESSOR_SYNC_CONTRIBUTION_QUEUE_TOTAL: Result<IntGauge> = try_create_int_gauge(
|
|
"beacon_processor_sync_contribution_queue_total",
|
|
"Count of sync committee contributions waiting to be processed."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_SYNC_CONTRIBUTION_VERIFIED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_sync_contribution_verified_total",
|
|
"Total number of sync committee contributions verified for gossip."
|
|
);
|
|
|
|
pub static ref BEACON_PROCESSOR_SYNC_CONTRIBUTION_IMPORTED_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_sync_contribution_imported_total",
|
|
"Total number of sync committee contributions imported to fork choice, etc."
|
|
);
|
|
|
|
/// Errors and Debugging Stats
|
|
pub static ref GOSSIP_ATTESTATION_ERRORS_PER_TYPE: Result<IntCounterVec> =
|
|
try_create_int_counter_vec(
|
|
"gossipsub_attestation_errors_per_type",
|
|
"Gossipsub attestation errors per error type",
|
|
&["type"]
|
|
);
|
|
pub static ref GOSSIP_SYNC_COMMITTEE_ERRORS_PER_TYPE: Result<IntCounterVec> =
|
|
try_create_int_counter_vec(
|
|
"gossipsub_sync_committee_errors_per_type",
|
|
"Gossipsub sync_committee errors per error type",
|
|
&["type"]
|
|
);
|
|
}
|
|
|
|
lazy_static! {
|
|
|
|
/*
|
|
* Bandwidth metrics
|
|
*/
|
|
pub static ref INBOUND_LIBP2P_BYTES: Result<IntGauge> =
|
|
try_create_int_gauge("libp2p_inbound_bytes", "The inbound bandwidth over libp2p");
|
|
|
|
pub static ref OUTBOUND_LIBP2P_BYTES: Result<IntGauge> = try_create_int_gauge(
|
|
"libp2p_outbound_bytes",
|
|
"The outbound bandwidth over libp2p"
|
|
);
|
|
pub static ref TOTAL_LIBP2P_BANDWIDTH: Result<IntGauge> = try_create_int_gauge(
|
|
"libp2p_total_bandwidth",
|
|
"The total inbound/outbound bandwidth over libp2p"
|
|
);
|
|
|
|
|
|
/*
|
|
* Sync related metrics
|
|
*/
|
|
pub static ref PEERS_PER_SYNC_TYPE: Result<IntGaugeVec> = try_create_int_gauge_vec(
|
|
"sync_peers_per_status",
|
|
"Number of connected peers per sync status type",
|
|
&["sync_status"]
|
|
);
|
|
pub static ref SYNCING_CHAINS_COUNT: Result<IntGaugeVec> = try_create_int_gauge_vec(
|
|
"sync_range_chains",
|
|
"Number of Syncing chains in range, per range type",
|
|
&["range_type"]
|
|
);
|
|
pub static ref SYNC_SINGLE_BLOCK_LOOKUPS: Result<IntGauge> = try_create_int_gauge(
|
|
"sync_single_block_lookups",
|
|
"Number of single block lookups underway"
|
|
);
|
|
pub static ref SYNC_PARENT_BLOCK_LOOKUPS: Result<IntGauge> = try_create_int_gauge(
|
|
"sync_parent_block_lookups",
|
|
"Number of parent block lookups underway"
|
|
);
|
|
|
|
/*
|
|
* Block Delay Metrics
|
|
*/
|
|
pub static ref BEACON_BLOCK_GOSSIP_PROPAGATION_VERIFICATION_DELAY_TIME: Result<Histogram> = try_create_histogram(
|
|
"beacon_block_gossip_propagation_verification_delay_time",
|
|
"Duration between when the block is received and when it is verified for propagation.",
|
|
);
|
|
pub static ref BEACON_BLOCK_GOSSIP_SLOT_START_DELAY_TIME: Result<Histogram> = try_create_histogram(
|
|
"beacon_block_gossip_slot_start_delay_time",
|
|
"Duration between when the block is received and the start of the slot it belongs to.",
|
|
);
|
|
pub static ref BEACON_BLOCK_GOSSIP_ARRIVED_LATE_TOTAL: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_block_gossip_arrived_late_total",
|
|
"Count of times when a gossip block arrived from the network later than the attestation deadline.",
|
|
);
|
|
|
|
/*
|
|
* Attestation reprocessing queue metrics.
|
|
*/
|
|
pub static ref BEACON_PROCESSOR_REPROCESSING_QUEUE_TOTAL: Result<IntGaugeVec> =
|
|
try_create_int_gauge_vec(
|
|
"beacon_processor_reprocessing_queue_total",
|
|
"Count of items in a reprocessing queue.",
|
|
&["type"]
|
|
);
|
|
pub static ref BEACON_PROCESSOR_REPROCESSING_QUEUE_EXPIRED_ATTESTATIONS: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_reprocessing_queue_expired_attestations",
|
|
"Number of queued attestations which have expired before a matching block has been found."
|
|
);
|
|
pub static ref BEACON_PROCESSOR_REPROCESSING_QUEUE_MATCHED_ATTESTATIONS: Result<IntCounter> = try_create_int_counter(
|
|
"beacon_processor_reprocessing_queue_matched_attestations",
|
|
"Number of queued attestations where as matching block has been imported."
|
|
);
|
|
|
|
}
|
|
|
|
pub fn update_bandwidth_metrics(bandwidth: Arc<BandwidthSinks>) {
|
|
set_gauge(&INBOUND_LIBP2P_BYTES, bandwidth.total_inbound() as i64);
|
|
set_gauge(&OUTBOUND_LIBP2P_BYTES, bandwidth.total_outbound() as i64);
|
|
set_gauge(
|
|
&TOTAL_LIBP2P_BANDWIDTH,
|
|
(bandwidth.total_inbound() + bandwidth.total_outbound()) as i64,
|
|
);
|
|
}
|
|
|
|
pub fn register_attestation_error(error: &AttnError) {
|
|
inc_counter_vec(&GOSSIP_ATTESTATION_ERRORS_PER_TYPE, &[error.as_ref()]);
|
|
}
|
|
|
|
pub fn register_sync_committee_error(error: &SyncCommitteeError) {
|
|
inc_counter_vec(&GOSSIP_SYNC_COMMITTEE_ERRORS_PER_TYPE, &[error.as_ref()]);
|
|
}
|
|
|
|
pub fn update_gossip_metrics<T: EthSpec>(
|
|
gossipsub: &Gossipsub,
|
|
network_globals: &Arc<NetworkGlobals<T>>,
|
|
) {
|
|
// Mesh peers per client
|
|
// Reset the gauges
|
|
for client_kind in ClientKind::iter() {
|
|
set_gauge_vec(
|
|
&BEACON_BLOCK_MESH_PEERS_PER_CLIENT,
|
|
&[client_kind.as_ref()],
|
|
0_i64,
|
|
);
|
|
set_gauge_vec(
|
|
&BEACON_AGGREGATE_AND_PROOF_MESH_PEERS_PER_CLIENT,
|
|
&[client_kind.as_ref()],
|
|
0_i64,
|
|
);
|
|
}
|
|
|
|
for topic_hash in gossipsub.topics() {
|
|
if let Ok(topic) = GossipTopic::decode(topic_hash.as_str()) {
|
|
match topic.kind() {
|
|
GossipKind::Attestation(_subnet_id) => {}
|
|
GossipKind::BeaconBlock => {
|
|
for peer_id in gossipsub.mesh_peers(topic_hash) {
|
|
let client = network_globals
|
|
.peers
|
|
.read()
|
|
.peer_info(peer_id)
|
|
.map(|peer_info| peer_info.client().kind.into())
|
|
.unwrap_or_else(|| "Unknown");
|
|
if let Some(v) =
|
|
get_int_gauge(&BEACON_BLOCK_MESH_PEERS_PER_CLIENT, &[client])
|
|
{
|
|
v.inc()
|
|
};
|
|
}
|
|
}
|
|
GossipKind::BeaconAggregateAndProof => {
|
|
for peer_id in gossipsub.mesh_peers(topic_hash) {
|
|
let client = network_globals
|
|
.peers
|
|
.read()
|
|
.peer_info(peer_id)
|
|
.map(|peer_info| peer_info.client().kind.into())
|
|
.unwrap_or_else(|| "Unknown");
|
|
if let Some(v) = get_int_gauge(
|
|
&BEACON_AGGREGATE_AND_PROOF_MESH_PEERS_PER_CLIENT,
|
|
&[client],
|
|
) {
|
|
v.inc()
|
|
};
|
|
}
|
|
}
|
|
GossipKind::SyncCommitteeMessage(_subnet_id) => {}
|
|
_kind => {}
|
|
}
|
|
}
|
|
}
|
|
}
|
|
|
|
pub fn update_sync_metrics<T: EthSpec>(network_globals: &Arc<NetworkGlobals<T>>) {
|
|
// reset the counts
|
|
if PEERS_PER_SYNC_TYPE
|
|
.as_ref()
|
|
.map(|metric| metric.reset())
|
|
.is_err()
|
|
{
|
|
return;
|
|
};
|
|
|
|
// count per sync status, the number of connected peers
|
|
let mut peers_per_sync_type = FnvHashMap::default();
|
|
for sync_type in network_globals
|
|
.peers
|
|
.read()
|
|
.connected_peers()
|
|
.map(|(_peer_id, info)| info.sync_status().as_str())
|
|
{
|
|
*peers_per_sync_type.entry(sync_type).or_default() += 1;
|
|
}
|
|
|
|
for (sync_type, peer_count) in peers_per_sync_type {
|
|
set_gauge_entry(&PEERS_PER_SYNC_TYPE, &[sync_type], peer_count);
|
|
}
|
|
}
|