Recommended: Link to working doc with images
Asymmetric Risk/Reward for Good vs Bad Actors
The key to an ideal implementation will be a successful asymmetrically prominent reward for good actors and asymmetrically prominent penalty for bad actors. Taking arbitrary numbers for example, a mechanism that rewards good actors by up to 40% with maximum downside of 10-20% and penalizes bad actors by up to 75-100% with the same reward profile would be a successful asymmetric risk/reward design for good vs bad actors.
- Vote! Voting should always be better off than not voting.
- Cooperate! The group should be rewarded for cooperating (finalizing with 100% p_v is a less fragile equilibrium than finalizing with 67% p_v, so they should be rewarded as such), and vice versa.
- Finalize! There’s a discontinuous jump in rewards and penalties if the proportion of the validator set p_v voting on the correct checkpoint is lower than the supermajority.
Shapes of Y & N with respect to each p_v, ESF, TD
Here’s a proof of concept shapes of Y & N with respect to proportion of validators who voted, time since finality and total deposits. I’m sure a lot of this will change as we iterate, but here’s a first cut.
- If you vote, you’re always better off than not voting.
- If you vote, there’s no penalty other than the slashing conditions.
- The more time passes since finality, the more your rewards deteriorate (until it’s 0) and the more the penalties become draconian.
- There’s a fixed constant penalty (or removal of rewards) if p_v is below supermajority.
- Reward rate increases if a higher proportion of the validators vote (or reward rate is decreased if a lower proportion of the validators vote. Same effect with non-voting.
- Yield with respect to total deposits reflect fixed income assets (i.e. asymptotically approach market required returns as determined by validator set, more in +Casper FFG: Returns, Deposits & Market Cap). There’s some reasonable min TD to prevent excessively high yield.
Bootstrapping vs Mature Networks
The sections above suggested shapes of the rewards/penalties with respect to relevant variables. Let’s add another variable and another layer of complexity: maturity of the network (the validator set).
If the ultimate goal for total deposits is a $1B and the network is currently at $10M in deposits, we have to adjust the magnitude of the changes (i.e. slope, coefficients).
For example, the observed rewards level maybe trend down as the network becomes more stable and the required returns trend down. The observed penalties may also come down as security increases with higher TD as % of MCap.
Need more eyes on this please, feedback welcome!