A "human readable" subjectivity format proposal

vladzamfir · August 30, 2017, 4:36pm

Hi Everyone,

This is my first post, and hopefully more later

I think Casper (or other protocols with weak subjectivity) would benefit from formats for “the weak subjectivity information”, so that people can easily and safely share the information required to synchronize with the consensus:

a list of public crypto credentials and weights associated with the validators,
a block finalized by that validator set.

The above is required to make a fork-choice and to tell when new blocks are finalized. But we naturally would like to replace this with the following (which represents the “status quo proposal”):

a single block hash.

Armed with a single block hash, you could connect to the network and ask peers for (and hopefully receive):

a block with that hash
merkle proofs for the block’s current validator set
a block finalized by the block’s current validator set*

And then after receiving and authenticating the block’s hash, the merkle proofs, and finality on the other block, you can safely use the fork-choice rule.

This (or something like this) can be made to work, but we did make a couple of background assumptions:

you are able to peer
the weak subjectivity information is not expired

Clients are already required to have at least a hash in order to authenticate the consensus (fork choice and finality) for the first time, so I’d like to require some additional information:

[IP_address, block hash, expiry date**]

The IP address is trusted to provide peers who can provided information that can be authenticated via a hash collision with the block hash. The block hash is trusted information (trusted to have a currently-bonded validator set), and the expiry date is just there to prevent the user from using stale authentication information.

The main advantage of this proposal is that it removes the need for bootstrap seed nodes which provide clients with peers.

The challenge is to make it as easy and safe as possible for people to share this information. One idea is to encode the information in a QR Code, which is absolutely useful. However, for a human readable format (something that could be easily written down by hand and read aloud), the following might represent an improvement:

[domain name in DNS, first 14 characters of Base58Encoded(block hash), expiry date]

I chose “Base58” encoding (used in Bitcoin) because it’s a bit safer than base 64, but base 64 would provide more compression. Given the odds of a mistyping will still be the hash of an invalid block, I think of using base 64 as having a liveness bug for users who don’t enter the hash correctly.

I went with DNS instead of ENS because using ENS requires already having peers (just as DNS requires having a DNS server, but presumably that’s easier to do). All in all, I think the choice between using DNS and using an IP address is about whether a usability/security trade-off is worth the gained readability.

I think there’s probably lots of room for improvement and other people’s thoughts!

Vlad

*It may be reasonable to do this with a new request type, or to look into the state trie for the last finalized block. But then it must be the case that the hash that you received has a block with state that contains the hash of a block finalized by that validator set

** should be a human readable date, rather than a block number or something like that

nate · August 31, 2017, 7:12am

It’s probably best practice to add a checksum to the block hash to further prevent mistakes, whatever encoding is used.

Also, excuse my ignorance, but what happens in the case of a community-splitting-hard-fork? Do users just have to make sure (or trust) the hash they are receiving is on the fork they prefer? It seems impossible to enforce, but could there ever be such a thing as a chain ID?

MicahZoltu · August 31, 2017, 8:04am

There was a discussion in EIPs at one point about making it so every fork resulted in a new identifier. The only way to keep the chain identifier would be to not change the consensus protocol in any way (meaning stagnate). I was a big fan of this because (among other reasons) it would allow for what you are asking, which is something that can be displayed to end-users and clients can validate (assuming the client acquired an old set of consensus rules via off-chain means).

vbuterin_old · August 31, 2017, 10:49am

How would this format get used? Would you ask your friends to send you the [IP_address, block_hash, expiry_date] triple, and then your client would make sure all the answers you get are compatible, and if so choose that chain?

If so, what’s the point of including the IP address?

vladzamfir · August 31, 2017, 5:32pm

Nate, good idea of having a checksum in the hash. It can even be just one character.

Micah, I’m also a big fan of the mechanism you describe, which was discussed (iirc) as a way to future-proof hard forks against replay attacks.

Vitalik, the IP address is necessary in order to find your first peers. The proposal here is to replace default seed nodes with weak subjectivity information, since it doesn’t substantially change the UX of weak subjectivity. Hope this makes sense!

vbuterin_old · September 3, 2017, 6:09am

Ah, I see! You’re basically making an argument that the default peers are in some sense already a kind of info that can only be transmitted by weak subjectivity, so we can just use the same mechanism that tells us the block hash to also give us peer lists, and that way we mitigate the centralization inherent in the default peer list. Nice!