Attestation committee based full PoS chains

vbuterin · June 15, 2018, 10:19am

Status: not a new idea but discussed before, but deserves its own post for reference and to assist with understanding. The last section is original.

Suppose that you have a PoS chain, where there is some validator set V, and every block has an associated random beacon state R. If block.parent.random_beacon = R, then block.random_beacon = F(R, block) for some deterministic F which outputs 32 bytes; there may be (in fact, definitely should be!)( constraints on the values put into the block that are factored into F.

The simplest way to use this to build a PoS chain is that for every block, you can use V and R to calculate an infinite sequence of potential signers for children of this block: skip0_signer(V, R), skip1_signer(V, R)…

441bdd6d

Any of these blocks is valid, but blocks can only be accepted by a client once that client’s clock time exceeds the block’s expected time, where expected time is calculated by GENESIS_TIME + k1 * height + k2 * total_skips_since_genesis; hence, blocks with earlier skip counts can always be accepted earlier. It may be reasonable to set k1 = k2, but not necessary.

Adding attestation committees

We can improve the stability of this design by requiring every block to also be attested by a committee: to build a block with parent B, one must first gather signatures from, say, at least 1/2 of a set of M validators, which is itself pseudorandomly sampled from V based on the value of R. This makes forking less likely, as it means that a relatively large number of validators need to collude to make a fork.

However, this design by itself loses liveness: if less than ~1/2 of nodes are online, then almost no block will be successfully attested to and the chain will halt. We can fix this problem by making the attestation committee have a variable size: for a block created by the k-skip signer, we require the attestation of the parent to contain at least M / (2 + * k) signatures. This means that the 0-skip block must contain at least a 1/2 committee, but blocks with skips can make do with smaller committees.

Notice that if portion p of nodes is online, then it will on average take ~1/(p-2) skips before the online nodes will be enough to form a sufficiently sized committee each round, and then after that point each individual signer has a probability of p of being online, so the chain will progress at a rate of 1 height per ~2.5p rounds. This is comparable to the ~1/p progress of a simple chain.

5d065882

Attack analysis

Suppose that parameters are set as above, and an attacker has less than 50% of the total stake. As M approaches infinity, this means that on both the canonical chain and a hypothetical attacking chain, in the best case, where an attacker has at least 33%, a 1-skip or height block will always succeed, but a 0-skip block will only succeed on the canonical chain. The average inter-block delay on the canonical chain, with p > 0.5 support, will be:

k1 + k2 * (1-p) + k2 * (1-p)² + k2 * (1-p)³ + .....

On the attack chain, it will be:

k1 + k2 + k2 * p + k2 * p² + .....

It is clear that the latter expression is greater, and so the attacker’s chain will grow more slowly than the canonical chain.

Note that the above assumes that which validator goes in which “slot” is set in stone. If it is modifiable, then more complex analysis is required; that said, the attestation requirement is clearly an improvement because it penalizes the rate of growth of chains that have less than 50% of validators supporting them, by a factor of ~1/4. Note also that a chain with p=0.500001 with average luck will on average create a block in k1 + k2 - ε seconds, and a chain with p=0.499999 with hypothetical “perfect luck” will on average create a block in k1 + k2 + ε seconds, so the disparity is large enough to overcome all path search attacks.

Adding short-range “something at stake” without micro-penalties

The next question is: how do we solve the nothing-at-stake problem in the short range? The simplest fix is to have penalties for creating or attesting to an offchain block: if you were to be rewarded R for creating or attesting to a block that becomes part of the canonical chain, you get penalized R for creating or attesting to a block that does not become part of the canonical chain. This replicates the incentives of PoW, where mining an offchain block is implicitly penalized because the miner needs to consume CPU resources to create the block but is not rewarded for it.

But with large attester committees this creates a lot of overhead processing frequent penalties, and we could do better by removing the need for small penalties entirely, instead relying on slashing conditions to prevent validators from attesting to absolutely everything under the sun. Suppose that the data used to select the attester set for a block comes not from that block’s beacon value, but from that of its parent. Then, the various children to the same parent block will share the same attester set. We can then add a slashing condition heavily penalizing validators for attesting to two children of the same block, and thus attesting to one block has the implied cost that one can no longer also attest to its siblings, and so this creates a cost to attesting a potential challenger to the head of a chain. However, this does not extend to cousins and more distant relations.

We could extend this by making attester sets pre-selected far earlier. If hypothetically the validator set never changed and attester sets at each height were selected at genesis, then attesting to any one block would preclude the attester from attesting any other block at the same height, and so it would be risky to attest to any block that is not the head. However, pre-selecting attesters too early itself sacrifices flexibility and has its risks. One option is to re-select attester sets every time a block gets finalized; there is no value in signing messages on non-finalized chains as it is not possible for them to finalize in any case. Another hybrid route is to introduce some randomness, but only, for example, swap 1-10% of validators in the attester set every block based on that block’s new entropy. This creates the effect that attesting to a block definitely precludes one from attesting to its siblings, probably precludes one from attesting to any specific close cousin, and makes it marginally less likely that one will be able to attest to a distant cousin (but there is also very very little value in attesting to blocks far away from the head, so large disincentives are not really needed to begin with).

[past] -> [B: expected time 1504124501]
[B: expected time 1504124501] -> […]
[B: expected time 1504124501] -> [2 skip block: expected time 1504124519]
[B: expected time 1504124501] -> [1 skip block: expected time 1504124513]
[B: expected time 1504124501] -> [0 skip block: expected time 1504124507]
[img]
[B]
[B] -> […]
[B] -> [2 skip block: T >= 1504124519; attestations >= 25]
[B] -> [1 skip block: T >= 1504124513; attestations >= 34]
[B] -> [0 skip block: T >= 1504124507; attestations >= 50]
[/yuml]

naterush · June 26, 2018, 8:16am

Adding attestation committees sounds like a very reasonable way of limiting forking. As a side note, they seem like a helpful piece of the puzzle in “allowing merge blocks between connected shards in any DAG of shards (that isn’t just a tree + hierarchy).” The issue with allowing merge blocks between connected shards in any ole DAG is that both shards have the ability to fork and orphan these blocks, causing the other shard to have to do the same (etc).

we require the attestation of the parent to contain at least M / (2 + * k) signatures

Should this be M / (2 + 4/3 * k) (what I got plugging into your examples below )? If so, is there any reasoning for this specific constant?

We can then add a slashing condition heavily penalizing validators for attesting to two children of the same block

Consider your above example, where M = 134 (gotten from multiplying 0 skip attestation committee size by 2), and where 67 - 1 = 66 validators attest to the 0 skip, and then 40 - 1 = 39 attest to the 1 skip, and then 29 - 1 = 28 attest to the 2 skip, and then 1 attests to the 3 skip, so that all 134 validators attesting to some block. With this slashing condition, it seems like none of the validators will be able to attest to any other block (a child and at this height).

This makes it seem like the committee-size-for-skip-block function should be \lfloor M / (2 + 4/3 * k) \rfloor, so the size of the committee required eventually goes to 0. Otherwise, we run the risk of getting stuck permanently.

Furthermore, we can calculate how many blocks must be made before the committee size goes to 0. In the example above, this is when 134 / (2 + 4/3 * k) < 1, or k > 99. Essentially, if validators get stuck in the manner described above, we would have to wait until the 100th skip block before we could move forward. Although the specific example I gave was very contrived, one can come up with probably-more-common cases where we still have to wait a large number of skip block before any progress can be made.

One possible improvement would be to allow a validator to switch from a i skip block to a j skip block if a) j < i, and b) the committee currently on j > committee currently on i, but now we get into more-complicated and less-aggregatable things like justifications. Maybe there’s a simpler scheme that can mitigate the above examples.

One option is to re-select attester sets every time a block gets finalized

This seems like it might be dangerous, as the attester set controls the fork choice and can choose to orphan anything that leads to finality, and thus stop themselves from getting rotated out.

only, for example, swap 1-10% of validators in the attester set every block based on that block’s new entropy

This, or choosing a new attester set each block, seems like the most reasonable approach to me

vbuterin · June 27, 2018, 1:41pm

Fixed now

This seems like it might be dangerous, as the attester set controls the fork choice and can choose to orphan anything that leads to finality, and thus stop themselves from getting rotated out.

So my thinking on this has evolved now. My current favored approach is that at the start of each epoch, we randomly select a shuffling of the entire validator set, and then split that shuffling into (epoch length) pieces, where each piece corresponds to one of the block heights in the next epoch.

Someone who literally controls the full attester set for the same height can certainly prevent finality, but that’s statistically infeasible to achieve; and if you only control 95% of the attester set then you can only delay finality by ~20 blocks.

An alternative formulation is where the epoch length is defined in slots, not height; in this case, the formulation could be that each slot defines N attesters, each block needs at least N/2 attesters, but the attesters could come from the combination of all slots since the slot that created the last block.

ldct · June 29, 2018, 6:34am

question about the basic system (before the section “adding attestation committees”): who produces the 0 skip block, the 1 skip block, etc?