lighthouse

mirror of https://github.com/sigp/lighthouse.git synced 2026-04-30 03:03:45 +00:00

Author	SHA1	Message	Date
Paul Hauner	03b984aa89	Add extra_data field	2021-09-29 14:40:20 +10:00
Paul Hauner	7091adf58c	Integrate execute_payload	2021-09-29 14:40:18 +10:00
Paul Hauner	1c2b59f851	Add block_on to execution_layer	2021-09-29 14:38:28 +10:00
Paul Hauner	203a93b3e1	Add block processing methods to ExecutionLayer	2021-09-29 14:38:28 +10:00
Paul Hauner	f698b91d77	Add CLI flags	2021-09-29 14:38:27 +10:00
Paul Hauner	81a62e33d7	Thread execution layer into ClientBuilder	2021-09-29 14:38:27 +10:00
Paul Hauner	95ef497e7b	Fix clippy lints	2021-09-29 14:38:27 +10:00
Paul Hauner	4fe318c2e5	Begin threading execution layer into BeaconChain	2021-09-29 14:38:27 +10:00
Paul Hauner	74a25cebdb	Finish adding tests	2021-09-29 14:38:27 +10:00
Paul Hauner	68e24d4cc1	Fix camelCase	2021-09-29 14:38:27 +10:00
Paul Hauner	9e7b4327f1	Add first test	2021-09-29 14:38:26 +10:00
Paul Hauner	31ad3239d4	Switch to new rpc sending method	2021-09-29 14:38:26 +10:00
Paul Hauner	95e9407cd9	Finish custom JSON response handler	2021-09-29 14:38:26 +10:00
Paul Hauner	cb5e33d53c	Start adding json rpc wrapper	2021-09-29 14:38:26 +10:00
Paul Hauner	08308c0000	Add all minimal spec endpoints	2021-09-29 14:38:25 +10:00
Paul Hauner	3d2bc6db9e	Add executePayload	2021-09-29 14:38:25 +10:00
Paul Hauner	ac1cdc5ca4	Modify decoding	2021-09-29 14:38:25 +10:00
Paul Hauner	7433385fb3	Add bones of execution_layer	2021-09-29 14:38:25 +10:00
Paul Hauner	1ce8339d96	Make eth1::http functions pub	2021-09-29 14:38:25 +10:00
ethDreamer	0a0deb73e3	Finished Gossip Block Validation Conditions (#2640 ) * Gossip Block Validation is Much More Efficient Co-authored-by: realbigsean <seananderson33@gmail.com>	2021-09-28 18:36:03 -05:00
ethDreamer	29097d3dae	Fork boundary fix (#2646 ) * Fixed Gossip Topics on Fork Boundary	2021-09-28 18:09:08 -05:00
realbigsean	e559bd9f59	Store execution block hash in fork choice (#2643 ) * - Update the fork choice `ProtoNode` to include `is_merge_complete` - Add database migration for the persisted fork choice * update tests * Small cleanup * lints * store execution block hash in fork choice rather than bool	2021-09-29 08:50:51 +10:00
Paul Hauner	b48f133a8c	Fix clippy lints on merge-f2f (#2626 ) * Remove unchecked arith from ssz_derive * Address clippy lints in block_verfication * Use safe math for is_valid_gas_limit	2021-09-29 08:50:50 +10:00
Michael Sproul	ef6158f4ee	Fix consensus, SSZ, tree hash & run merge EF tests (#2622 ) * Update to v1.1.0-beta.4 (squash of #2548) * SSZ, cached tree hash, EF tests	2021-09-29 08:50:50 +10:00
Mark Mackey	3718c36c51	Initial merge changes Added Execution Payload from Rayonism Fork Updated new Containers to match Merge Spec Updated BeaconBlockBody for Merge Spec Completed updating BeaconState and BeaconBlockBody Modified ExecutionPayload<T> to use Transaction<T> Mostly Finished Changes for beacon-chain.md Added some things for fork-choice.md Update to match new fork-choice.md/fork.md changes ran cargo fmt Added Missing Pieces in eth2_libp2p for Merge fix ef test Various Changes to Conform Closer to Merge Spec	2021-09-29 08:50:48 +10:00
realbigsean	113ef74ef6	Add contribution and proof event (#2527 ) ## Issue Addressed N/A ## Proposed Changes Add the new ContributionAndProof event: https://github.com/ethereum/beacon-APIs/pull/158 ## Additional Info N/A Co-authored-by: realbigsean <seananderson33@gmail.com>	2021-09-25 07:53:58 +00:00
Paul Hauner	fe52322088	Implement SSZ union type (#2579 ) ## Issue Addressed NA ## Proposed Changes Implements the "union" type from the SSZ spec for `ssz`, `ssz_derive`, `tree_hash` and `tree_hash_derive` so it may be derived for `enums`: https://github.com/ethereum/consensus-specs/blob/v1.1.0-beta.3/ssz/simple-serialize.md#union The union type is required for the merge, since the `Transaction` type is defined as a single-variant union `Union[OpaqueTransaction]`. ### Crate Updates This PR will (hopefully) cause CI to publish new versions for the following crates: - `eth2_ssz_derive`: `0.2.1` -> `0.3.0` - `eth2_ssz`: `0.3.0` -> `0.4.0` - `eth2_ssz_types`: `0.2.0` -> `0.2.1` - `tree_hash`: `0.3.0` -> `0.4.0` - `tree_hash_derive`: `0.3.0` -> `0.4.0` These these crates depend on each other, I've had to add a workspace-level `[patch]` for these crates. A follow-up PR will need to remove this patch, ones the new versions are published. ### Union Behaviors We already had SSZ `Encode` and `TreeHash` derive for enums, however it just did a "transparent" pass-through of the inner value. Since the "union" decoding from the spec is in conflict with the transparent method, I've required that all `enum` have exactly one of the following enum-level attributes: #### SSZ - `#[ssz(enum_behaviour = "union")]` - matches the spec used for the merge - `#[ssz(enum_behaviour = "transparent")]` - maintains existing functionality - not supported for `Decode` (never was) #### TreeHash - `#[tree_hash(enum_behaviour = "union")]` - matches the spec used for the merge - `#[tree_hash(enum_behaviour = "transparent")]` - maintains existing functionality This means that we can maintain the existing transparent behaviour, but all existing users will get a compile-time error until they explicitly opt-in to being transparent. ### Legacy Option Encoding Before this PR, we already had a union-esque encoding for `Option<T>`. However, this was with the old SSZ spec where the union selector was 4 bytes. During merge specification, the spec was changed to use 1 byte for the selector. Whilst the 4-byte `Option` encoding was never used in the spec, we used it in our database. Writing a migrate script for all occurrences of `Option` in the database would be painful, especially since it's used in the `CommitteeCache`. To avoid the migrate script, I added a serde-esque `#[ssz(with = "module")]` field-level attribute to `ssz_derive` so that we can opt into the 4-byte encoding on a field-by-field basis. The `ssz::legacy::four_byte_impl!` macro allows a one-liner to define the module required for the `#[ssz(with = "module")]` for some `Option<T> where T: Encode + Decode`. Notably, I have removed `Encode` and `Decode` impls for `Option`. I've done this to force a break on downstream users. Like I mentioned, `Option` isn't used in the spec so I don't think it'll be that annoying. I think it's nicer than quietly having two different union implementations or quietly breaking the existing `Option` impl. ### Crate Publish Ordering I've modified the order in which CI publishes crates to ensure that we don't publish a crate without ensuring we already published a crate that it depends upon. ## TODO - [ ] Queue a follow-up `[patch]`-removing PR.	2021-09-25 05:58:36 +00:00
Age Manning	00a7ef0036	Correct bug in sync (#2615 ) A bug that causes failed batches to continually download in a loop is corrected.	2021-09-23 01:32:04 +00:00
Paul Hauner	be11437c27	Batch BLS verification for attestations (#2399 ) ## Issue Addressed NA ## Proposed Changes Adds the ability to verify batches of aggregated/unaggregated attestations from the network. When the `BeaconProcessor` finds there are messages in the aggregated or unaggregated attestation queues, it will first check the length of the queue: - `== 1` verify the attestation individually. - `>= 2` take up to 64 of those attestations and verify them in a batch. Notably, we only perform batch verification if the queue has a backlog. We don't apply any artificial delays to attestations to try and force them into batches. ### Batching Details To assist with implementing batches we modify `beacon_chain::attestation_verification` to have two distinct categories for attestations: - Indexed attestations: those which have passed initial validation and were valid enough for us to derive an `IndexedAttestation`. - Verified attestations: those attestations which were indexed and also passed signature verification. These are well-formed, interesting messages which were signed by validators. The batching functions accept `n` attestations and then return `n` attestation verification `Result`s, where those `Result`s can be any combination of `Ok` or `Err`. In other words, we attempt to verify as many attestations as possible and return specific per-attestation results so peer scores can be updated, if required. When we batch verify attestations, we first try to map all those attestations to indexed attestations. If any of those attestations were able to be indexed, we then perform batch BLS verification on those indexed attestations. If the batch verification succeeds, we convert them into verified attestations, disabling individual signature checking. If the batch fails, we convert to verified attestations with individual signature checking enabled. Ultimately, we optimistically try to do a batch verification of attestation signatures and fall-back to individual verification if it fails. This opens an attach vector for "poisoning" the attestations and causing us to waste a batch verification. I argue that peer scoring should do a good-enough job of defending against this and the typical-case gains massively outweigh the worst-case losses. ## Additional Info Before this PR, attestation verification took the attestations by value (instead of by reference). It turns out that this was unnecessary and, in my opinion, resulted in some undesirable ergonomics (e.g., we had to pass the attestation back in the `Err` variant to avoid clones). In this PR I've modified attestation verification so that it now takes a reference. I refactored the `beacon_chain/tests/attestation_verification.rs` tests so they use a builder-esque "tester" struct instead of a weird macro. It made it easier for me to test individual/batch with the same set of tests and I think it was a nice tidy-up. Notably, I did this last to try and make sure my new refactors to actual production code would pass under the existing test suite.	2021-09-22 08:49:41 +00:00
Michael Sproul	9667dc2f03	Implement checkpoint sync (#2244 ) ## Issue Addressed Closes #1891 Closes #1784 ## Proposed Changes Implement checkpoint sync for Lighthouse, enabling it to start from a weak subjectivity checkpoint. ## Additional Info - [x] Return unavailable status for out-of-range blocks requested by peers (#2561) - [x] Implement sync daemon for fetching historical blocks (#2561) - [x] Verify chain hashes (either in `historical_blocks.rs` or the calling module) - [x] Consistency check for initial block + state - [x] Fetch the initial state and block from a beacon node HTTP endpoint - [x] Don't crash fetching beacon states by slot from the API - [x] Background service for state reconstruction, triggered by CLI flag or API call. Considered out of scope for this PR: - Drop the requirement to provide the `--checkpoint-block` (this would require some pretty heavy refactoring of block verification) Co-authored-by: Diva M <divma@protonmail.com>	2021-09-22 00:37:28 +00:00
Age Manning	280e4fe23d	Increase connection limits and allow priority connections (#2604 ) In previous network updates we have made our libp2p connections more lean by limiting the maximum number of connections a lighthouse node will accept before libp2p rejects new connections. However, we still maintain the logic that at maximum connections, we try to dial extra peers if they are needed by a validator client to publish messages on a specific subnet. The dials typically result in failures due the libp2p connection limits. This PR adds an extra factor, `PRIORITY_PEER_EXCESS` which sets aside a new allocation of peers we are able to dial in case we need these peers for the VC client. This allocation sits along side the excess peer (which allows extra incoming peers on top of our target peer limit). The drawback here, is that libp2p now allows extra peers to connect to us (beyond the standard peer limit) which the peer manager should subsequently reject.	2021-09-21 07:45:13 +00:00
Age Manning	a73dcb7b6d	Improved handling of IP Banning (#2530 ) This PR in general improves the handling around peer banning. Specifically there were issues when multiple peers under a single IP connected to us after we banned the IP for poor behaviour. This PR should now handle these peers gracefully as well as make some improvements around how we previously disconnected and banned peers. The logic now goes as follows: - Once a peer gets banned, its gets registered with its known IP addresses - Once enough banned peers exist under a single IP that IP is banned - We retain connections with existing peers under this IP - Any new connections under this IP are rejected	2021-09-17 04:02:31 +00:00
Pawan Dhananjay	64ad2af100	Subscribe to altair gossip topics 2 slots before fork (#2532 ) ## Issue Addressed N/A ## Proposed Changes Add a fork_digest to `ForkContext` only if it is set in the config. Reject gossip messages on post fork topics before the fork happens. Edit: Instead of rejecting gossip messages on post fork topics, we now subscribe to post fork topics 2 slots before the fork. Co-authored-by: Age Manning <Age@AgeManning.com>	2021-09-17 01:11:16 +00:00
Age Manning	56e0615df8	Experimental discovery (#2577 ) # Description A few changes have been made to discovery. In particular a custom re-write of an LRU cache which previously was read/write O(N) for all our sessions ~5k, to a more reasonable hashmap-style O(1). Further there has been reported issues in the current discv5, so added error handling to help identify the issue has been added.	2021-09-16 04:45:05 +00:00
Age Manning	95b17137a8	Reduce network debug noise (#2593 ) The identify network debug logs can get quite noisy and are unnecessary to print on every request/response. This PR reduces debug noise by only printing messages for identify messages that offer some new information.	2021-09-14 08:28:35 +00:00
Wink Saville	4755d4b236	Update sloggers to v2.0.2 (#2588 ) fixes #2584	2021-09-14 06:48:26 +00:00
Paul Hauner	f9bba92db3	v1.5.2 (#2595 ) ## Issue Addressed NA ## Proposed Changes Version bump ## Additional Info Please do not `bors` without my approval, I am still testing.	2021-09-13 23:01:19 +00:00
Paul Hauner	ddbd4e6965	v1.5.2-rc.0 (#2565 ) ## Issue Addressed NA ## Proposed Changes - Bump version - Tidy some comments mangled by the version change regex. ## Additional Info NA	2021-09-03 23:28:21 +00:00
Michael Sproul	9c785a9b33	Optimize `process_attestation` with active balance cache (#2560 ) ## Proposed Changes Cache the total active balance for the current epoch in the `BeaconState`. Computing this value takes around 1ms, and this was negatively impacting block processing times on Prater, particularly when reconstructing states. With a large number of attestations in each block, I saw the `process_attestations` function taking 150ms, which means that reconstructing hot states can take up to 4.65s (31 * 150ms), and reconstructing freezer states can take up to 307s (2047 * 150ms). I opted to add the cache to the beacon state rather than computing the total active balance at the start of state processing and threading it through. Although this would be simpler in a way, it would waste time, particularly during block replay, as the total active balance doesn't change for the duration of an epoch. So we save ~32ms for hot states, and up to 8.1s for freezer states (using `--slots-per-restore-point 8192`).	2021-09-03 07:50:43 +00:00
realbigsean	50321c6671	Updates to make crates publishable (#2472 ) ## Issue Addressed Related to: #2259 Made an attempt at all the necessary updates here to publish the crates to crates.io. I incremented the minor versions on all the crates that have been previously published. We still might run into some issues as we try to publish because I'm not able to test this out but I think it's a good starting point. ## Proposed Changes - Add description and license to `ssz_types` and `serde_util` - rename `serde_util` to `eth2_serde_util` - increment minor versions - remove path dependencies - remove patch dependencies ## Additional Info Crates published: - [x] `tree_hash` -- need to publish `tree_hash_derive` and `eth2_hashing` first - [x] `eth2_ssz_types` -- need to publish `eth2_serde_util` first - [x] `tree_hash_derive` - [x] `eth2_ssz` - [x] `eth2_ssz_derive` - [x] `eth2_serde_util` - [x] `eth2_hashing` Co-authored-by: realbigsean <seananderson33@gmail.com>	2021-09-03 01:10:25 +00:00
Pawan Dhananjay	5a3bcd2904	Validator monitor support for sync committees (#2476 ) ## Issue Addressed N/A ## Proposed Changes Add functionality in the validator monitor to provide sync committee related metrics for monitored validators. Co-authored-by: Michael Sproul <michael@sigmaprime.io>	2021-08-31 23:31:36 +00:00
Paul Hauner	44fa54004c	Persist to DB after setting canonical head (#2547 ) ## Issue Addressed NA ## Proposed Changes Missed head votes on attestations is a well-known issue. The primary cause is a block getting set as the head after the attestation deadline. This PR aims to shorten the overall time between "block received" and "block set as head" by: 1. Persisting the head and fork choice after setting the canonical head - Informal measurements show this takes ~200ms 1. Pruning the op pool after setting the canonical head. 1. No longer persisting the op pool to disk during `BeaconChain::fork_choice` - Informal measurements show this can take up to 1.2s. I also add some metrics to help measure the effect of these changes. Persistence changes like this run the risk of breaking assumptions downstream. However, I have considered these risks and I think we're fine here. I will describe my reasoning for each change. ## Reasoning ### Change 1: Persisting the head and fork choice after setting the canonical head For (1), although the function is called `persist_head_and_fork_choice`, it only persists: - Fork choice - Head tracker - Genesis block root Since `BeaconChain::fork_choice_internal` does not modify these values between the original time we were persisting it and the current time, I assert that the change I've made is non-substantial in terms of what ends up on-disk. There's the possibility that some other thread has modified fork choice in the extra time we've given it, but that's totally fine. Since the only time we read those values from disk is during startup, I assert that this has no impact during runtime. ### Change 2: Pruning the op pool after setting the canonical head Similar to the argument above, we don't modify the op pool during `BeaconChain::fork_choice_internal` so it shouldn't matter when we prune. This change should be non-substantial. ### Change 3: No longer persisting the op pool to disk during `BeaconChain::fork_choice` This change is substantial. With the proposed changes, we'll only be persisting the op pool to disk when we shut down cleanly (i.e., the `BeaconChain` gets dropped). This means we'll save disk IO and time during usual operation, but a `kill -9` or similar "crash" will probably result in an out-of-date op pool when we reboot. An out-of-date op pool can only have an impact when producing blocks or aggregate attestations/sync committees. I think it's pretty reasonable that a crash might result in an out-of-date op pool, since: - Crashes are fairly rare. Practically the only time I see LH suffer a full crash is when the OOM killer shows up, and that's a very serious event. - It's generally quite rare to produce a block/aggregate immediately after a reboot. Just a few slots of runtime is probably enough to have a decent-enough op pool again. ## Additional Info Credits to @macladson for the timings referenced here.	2021-08-31 04:48:21 +00:00
Pawan Dhananjay	b4dd98b3c6	Shutdown after sync (#2519 ) ## Issue Addressed Resolves #2033 ## Proposed Changes Adds a flag to enable shutting down beacon node right after sync is completed. ## Additional Info Will need modification after weak subjectivity sync is enabled to change definition of a fully synced node.	2021-08-30 13:46:13 +00:00
Michael Sproul	10945e0619	Revert bad blocks on missed fork (#2529 ) ## Issue Addressed Closes #2526 ## Proposed Changes If the head block fails to decode on start up, do two things: 1. Revert all blocks between the head and the most recent hard fork (to `fork_slot - 1`). 2. Reset fork choice so that it contains the new head, and all blocks back to the new head's finalized checkpoint. ## Additional Info I tweaked some of the beacon chain test harness stuff in order to make it generic enough to test with a non-zero slot clock on start-up. In the process I consolidated all the various `new_` methods into a single generic one which will hopefully serve all future uses 🤞	2021-08-30 06:41:31 +00:00
Mason Stallmo	bc14d1d73d	Add more unix signal handlers (#2486 ) ## Issue Addressed Resolves #2114 Swapped out the ctrlc crate for tokio signals to hook register handlers for SIGPIPE and SIGHUP along with SIGTERM and SIGINT. ## Proposed Changes - Swap out the ctrlc crate for tokio signals for unix signal handing - Register signals for SIGPIPE and SHIGUP that trigger the same shutdown procedure as SIGTERM and SIGINT ## Additional Info I tested these changes against the examples in the original issue and noticed some interesting behavior on my machine. When running `lighthouse bn --network pyrmont \|& tee -a pyrmont_bn.log` or `lighthouse bn --network pyrmont 2>&1 \| tee -a pyrmont_bn.log` none of the above signals are sent to the lighthouse program in a way I was able to observe. The only time it seems that the signal gets sent to the lighthouse program is if there is no redirection of stderr to stdout. I'm not as familiar with the details of how unix signals work in linux with a redirect like that so I'm not sure if this is a bug in the program or expected behavior. Signals are correctly received without the redirection and if the above signals are sent directly to the program with something like `kill`.	2021-08-30 05:19:34 +00:00
Pawan Dhananjay	99737c551a	Improve eth1 fallback logging (#2490 ) ## Issue Addressed Resolves #2487 ## Proposed Changes Logs a message once in every invocation of `Eth1Service::update` method if the primary endpoint is unavailable for some reason. e.g. ```log Aug 03 00:09:53.517 WARN Error connecting to eth1 node endpoint action: trying fallbacks, endpoint: http://localhost:8545/, service: eth1_rpc Aug 03 00:09:56.959 INFO Fetched data from fallback fallback_number: 1, service: eth1_rpc ``` The main aim of this PR is to have an accompanying message to the "action: trying fallbacks" error message that is returned when checking the endpoint for liveness. This is mainly to indicate to the user that the fallback was live and reachable. ## Additional info This PR is not meant to be a catch all for all cases where the primary endpoint failed. For instance, this won't log anything if the primary node was working fine during endpoint liveness checking and failed during deposit/block fetching. This is done intentionally to reduce number of logs while initial deposit/block sync and to avoid more complicated logic.	2021-08-30 00:51:26 +00:00
Paul Hauner	b0ac3464ca	v1.5.1 (#2544 ) ## Issue Addressed NA ## Proposed Changes - Bump version ## Additional Info NA	2021-08-27 01:58:19 +00:00
Paul Hauner	4405425726	Expand gossip duplicate cache time (#2542 ) ## Issue Addressed NA ## Proposed Changes This PR expands the time that entries exist in the gossip-sub duplicate cache. Recent investigations found that this cache is one slot (12s) shorter than the period for which an attestation is permitted to propagate on the gossip network. Before #2540, this was causing peers to be unnecessarily down-scored for sending old attestations. Although that issue has been fixed, the duplicate cache time is increased here to avoid such messages from getting any further up the networking stack then required. ## Additional Info NA	2021-08-26 23:25:50 +00:00
Paul Hauner	3fdad38eba	Remove penality for duplicate attestation from same validator (#2540 ) ## Issue Addressed NA ## Proposed Changes A Discord user presented logs which indicated a drop in their peer count caused by a variety of peers sending attestations where we'd already seen an attestation for that validator. It's presently unclear how this case came about, but during our investigation I noticed that we are down-voting peers for sending such attestations. There are three scenarios where we may receive duplicate unagg. attestations from the same validator: 1. The validator is committing a slashable offense. 2. The gossipsub message-deduping functionality is not working as expected. 3. We received the message via the HTTP prior to seeing it via gossip. Scenario (1) would be so costly for an attacker that I don't think we need to add DoS protection for it. Scenario (2) seems feasible. Our "seen message" caches in gossipsub might fill up/expire and let through these duplicates. There are also cases involving message ID mismatches with the other peers. In both these cases, I don't think we should be doing 1 attestation == -1 point down-voting. Scenario (3) is not necessarily a fault of the peer and we shouldn't down-score them for it. ## Additional Info NA	2021-08-26 08:00:50 +00:00
Age Manning	09545fe668	Increase maximum gossipsub subscriptions (#2531 ) Due to the altair fork, in principle we can now subscribe to up to 148 topics. This bypasses our original limit and we can end up rejecting subscriptions. This PR increases the limit to account for the fork.	2021-08-26 02:01:10 +00:00

1 2 3 4 5 ...

1667 Commits