Mining attacks on PoRA (Proof of Random Access)

snjax · April 17, 2024, 8:30pm

Abstract

This article analyzes the security of Proof of Random Access (PoRA) consensus mechanism against potential mining attacks. The focus is on two main attack vectors: the shrink attack, where an attacker uses cheaper or smaller storage devices with the same throughput to gain an economic advantage, and the Moore attack, where an attacker leverages advancements in technology to increase throughput while maintaining the same cost. The article examines these attacks under both unlimited and limited mining throughput scenarios and provides mathematical analysis to determine the conditions under which the attacks could be economically viable for miners.

Security model

In our security model, we consider an attacker who aims to optimize their mining procedure to gain an economic advantage over honest miners. The attacker can employ various strategies to achieve this goal:

Equipment replacement: The attacker may replace their storage equipment with more efficient or cost-effective alternatives. This strategy allows the attacker to maintain or increase their mining performance while reducing costs.
Partial fraction mining: The attacker can utilize a portion of their storage space and throughput for one client while using the remaining resources for another client. This allows the attacker to optimize their resource allocation and potentially gain an advantage by serving multiple clients simultaneously.

We will analyze two specific attack vectors:

Shrink attack: In this attack, the attacker replaces their storage device with a cheaper one that offers the same performance. For example, an attacker might replace a more expensive 1TiB NVMe drive with a cheaper 1TB drive that has the same throughput. This allows the attacker to reduce their storage costs while maintaining their mining performance. The key idea behind the shrink attack is that the attacker can take advantage of the fact that the consensus mechanism may not distinguish between storage devices of different sizes as long as they offer the same throughput.
Moore attack: This attack involves the attacker replacing their storage equipment with a newer, more advanced version that offers better performance at the same cost. For example, an attacker might replace a gen 4 storage device with a gen 5 device that has twice the throughput at the same cost. This allows the attacker to increase their mining performance without incurring additional expenses.

It is important to note that the Proof of Random Access (PoRA) consensus mechanism is well-scalable, meaning that there is no single machine limitation. This scalability allows for the possibility of using RAM rigs on tiny devices, such as single-board computers or even smartphones, to participate in the mining process. The absence of a single machine limitation opens up new opportunities for attackers to optimize their mining setups and potentially gain an advantage over honest miners.

Furthermore, the ability to perform partial fraction mining, where an attacker can allocate a portion of their storage space and throughput to one client while using the remaining resources for another client, adds another layer of complexity to the security model. This flexibility in resource allocation allows attackers to optimize their mining strategies and potentially gain an edge by serving multiple clients simultaneously.

Throughout the following sections, we will examine these attack vectors in detail, considering both unlimited and limited mining throughput scenarios. Our analysis will focus on determining the conditions under which these attacks could be economically viable for miners, and we will provide mathematical derivations to support our findings

Shrink attack

In the following sections, we consider three different attacks. The first attack assumes that the mining throughput is not limited and shows how in this case the adversary can gain advantage over honest miners by using a different hardware configuration. In the follwing two attacks, we assume that the mining throughput was limited to mitigate the first attack, but we show that new issues arise: adversary can be economically incentivised to drop some part of the stored file, and perform Moore attack using more efficient hardware than that of honest miners.

In our costs analyses we make a few reasonable assumptions about the relationship between the costs and make conservative estimates of adversary advantage.

Unlimited throughput: achieving advantage over honest miners

In this scenario, we consider the case where an attacker reduces the size of their memory module to gain an economic advantage. We make a pessimistic assumption that if the memory size is reduced by half, the maintenance cost (energy consumption) and throughput will remain the same, while the cost will be reduced by half. In reality, the energy consumption would likely be lower, but this assumption can only make our analysis more conservative.

To compare the cost efficiency of the attacker and the reference miner, we normalize the values by the cost and present them in the following table:

	Cost	Maintenance	Throughput
reference	1	A	1
attacker	1	\chi A	\chi

The reference miner’s cost of purchasing one unit of hardware is set to 1 as a baseline; this is without loss of generality as we normalize other values with respect to this, eliminating a free variable.
A \sim 1 represents the maintenance cost (energy consumption) per time unit for the reference miner, which is assumed to be close to 1 for simplicity.
The reference miner’s throughput is also normalized to 1.
The attacker’s cost is set to 1, assuming that one unit of attacker’s hardware has the same cost as that of a reference miner. This is a conservative estimate, since one unit of attacker’s hardware is assumed to be less efficient than that of a reference miner.
The attacker’s maintenance cost is \chi A, where \chi > 1 represents the throughput advantage of the attacker. This is because the attacker’s memory size is smaller, but the energy consumption per unit of memory remains the same.
The attacker’s throughput is \chi, reflecting their advantage in terms of throughput per unit cost.

To compare the total cost efficiency, we calculate the throughput per unit of total cost (cost + maintenance) for both the reference miner and the attacker:

Reference miner: \frac{1}{1+A}

Attacker: \frac{\chi}{1+\chi A} = \frac{1}{1/\chi+A}

Since \chi > 1 when the attacker reduces the memory size, we can conclude that:

\frac{1}{1+A} < \frac{\chi}{1+\chi A} = \frac{1}{1/\chi+A}

This inequality demonstrates that the attacker has a better total cost efficiency compared to the reference miner.

Therefore, the original PoRA is not resistant to shrink attacks under unlimited mining throughput conditions. The only way to protect against this vulnerability is to limit the mining rewards, which would discourage attackers from exploiting this weakness.

Limited throughput: not storing part of the file

In this scenario, we consider the case where the mining throughput is limited to an optimal value of 1, and we analyze the cost efficiency for an attacker who uses only a fraction p of their memory.

Let’s define the following variables:

n: the number of random accesses
q = 1 - p \ll 1, assuming qn \lesssim 1
n_e: the efficient average number of accesses, given by n_e = (1-p^n)/(1-p) \approx (1 - \exp(-qn)) / q
p_s: the success probability, given by p_s = p^n \approx \exp(-qn)
\tau: the slowdown of sampling, given by \tau = n_e/(n \cdot p_s) = (\exp(qn)-1)/(qn)
B: the sampling cost, assumed to be much smaller than 1 (B \ll 1) to ensure that the main cost of the algorithm is not CPU PoW

We can compare the reference miner and the attacker using the following table:

	Cost	Maintenance	Sampling	Throughput
reference	1	A	B	1
attacker	1	\chi A	\chi B	\chi
attacker	p	\chi p A	\chi p B\tau	\chi p

The reference miner’s costs and throughput are normalized to 1.
The attacker’s cost and throughput are scaled by \chi when using the full memory.
When the attacker uses only a fraction p of their memory, their cost, maintenance, and throughput are scaled by p, while the sampling cost is scaled by p\tau to account for the slowdown in sampling.

For qn \lesssim 1, we have \tau \sim 1, which means B\tau \ll 1.

To consume all throughput, the attacker must satisfy the equation: \chi p = 1.

For efficient mining, the following condition must be met:

p\cdot (1 + \chi A) + B \tau < 1 + A + B

Simplifying this condition, we get:

p + (\tau - 1) B < 1

q > (\tau - 1) B \approx qnB/2

n B \lesssim 2

To estimate B, let’s consider the example of a Samsung 970 SSD with a throughput of 2GB/s, TDP of 6W, and a value size of 1MB. The hash efficiency for CPU is 30MH/J, and for ASIC, it is 3GH/J.

The additional TDP for sampling will be:

For CPU: 2e9/1e6/30e6 = 6\text{e-5}W
For ASIC: 2e9/1e6/3e9 = 6\text{e-7}W

By dividing these values by the TDP, we can roughly estimate B to be in the range of 1\text{e-}5 to 1\text{e-}7.

This means that n should be greater than 1\text{e}5 to 1\text{e}7 to make the shrink attack inefficient, which may not be practical in real-world scenarios.

When qn \lesssim 1, p can take any value. For example, with a storage size of 1TB, value size of 1MB, n=1\text{e}4, and B=1\text{e-}5, we get q=1\text{e-}4, which means that 100 MB of data could be forgotten while still providing economic benefits for the miner.

Limited throughput: Moore attack

When considering the Moore attack, it’s important to note that miners will align their throughput to the limit imposed by the system. Let’s analyze the cost efficiency of the reference miner and the attacker in this scenario.

	Cost + Maintenance	Sampling	Throughput
reference	1	B	1
attacker	1	B	\chi
attacker	p	\theta Bp\tau	\theta \chi p

The reference miner’s cost, maintenance, and throughput are normalized to 1, with a sampling cost of B.
After upgrading their hardware, the attacker’s cost and maintenance remain the same, but their throughput increases by a factor of \chi.
When the attacker uses only a fraction p of their upgraded hardware, their sampling cost is scaled by \theta p\tau, where \theta is a throughput utilization parameter, and their throughput is scaled by \theta \chi p.

\theta represents the throughput utilization parameter, which indicates the fraction of the attacker’s upgraded throughput that they actually use. For example, if \theta = 0.8, the attacker is utilizing 80% of their upgraded throughput. This parameter allows us to model situations where the attacker may not be using their full upgraded capacity, either intentionally or due to technical limitations.

To consume all available throughput, the attacker must satisfy the equation: \theta \chi p = 1.

For efficient mining, the following condition must be met:

p \cdot (1 + \theta B \tau) < 1 + B

Expanding this condition using the approximations for \tau and p_s from the previous chapter, we get:

(1-q) (1 + \chi^{-1} B (1 + qn/2)) < 1 + B

Simplifying further:

\chi^{-1} nB/2 - 1 - B - \chi^{-1} qnB/2 < 0

\chi^{-1} pnB/2 < 1 + B

n B \lesssim 2 \chi

To find the optimal values for the arbitrary parameters \theta and p, we need to perform additional calculations. Taking the partial derivative of p \cdot (1 + \chi^{-1} B \tau) with respect to q, we get:

\partial_q (p \cdot (1 + \chi^{-1} B \tau)) \approx -(1 + \chi^{-1} B (1 + qn/2) + \chi^{-1} Bn/2 \cdot (1 + qn/3)) = \chi^{-1} Bn/2 - (1 + \chi^{-1} B) + \chi^{-1} B q n (n / 3 - 1/2) = 0

Solving for q, we get:

q = \frac{-Bn/2 + \chi + B}{B n (n / 3 - 1/2)} > 0

This result suggests that n should be greater than 1\text{e}5 to 1\text{e}7 to make the Moore attack inefficient.

For example, consider a storage size of 1TB, value size of 1MB, n=1\text{e}4, B=1\text{e-}5, and \chi=2. Plugging these values into the equation for q, we get q=0.005, which means that 5GB of data could be forgotten while still providing economic benefits for the attacker.

RAM rig

In the original PoRA paper, the authors compare the performance of a Samsung 970 EVO NVMe SSD and 256GB DDR4-3200 RAM. Based on the calculations in the previous sections, we arrive at a counterintuitive conclusion: when there are no throughput limitations, only the throughput matters, not the size of the storage. To further illustrate this point, let’s compare the efficiency of a Crucial T705 1TB NVMe SSD and Crucial 8GB DDR5-4800 RAM.

	Cost (USD)	TDP (W)	Throughput (GB/S)
NVMe	188	15	13.7
DDR5	25	10	72

The table above compares the cost, thermal design power (TDP), and throughput of the two storage devices. The NVMe SSD has a higher cost and TDP but a lower throughput compared to the DDR5 RAM.

To calculate the cost efficiency of each device, we need to consider the maintenance cost and the amortization of the equipment over its lifetime. Let’s assume that the maintenance cost for 1W of power is about 4.4 USD per year and that the equipment is amortized over 4 years.

For the NVMe SSD, the cost per 1 GB/s of throughput per year is:

(188/4 + 15*4.4) / 13.7 = 8.25 USD

For the DDR5 RAM, the cost per 1 GB/s of throughput per year is:

(25/4 + 10*4.4) / 72 = 0.70 USD

The results show that the DDR5 RAM is significantly more cost-efficient than the NVMe SSD when considering the cost per 1 GB/s of throughput per year. This finding supports the idea that, in the absence of throughput limitations, using high-throughput RAM can be more economically viable for mining than using NVMe SSDs, despite the difference in storage capacity.

Conclusion

The analysis of the shrink and Moore attacks on the PoRA consensus mechanism highlights potential vulnerabilities in the system. The article demonstrates that without proper limitations on mining rewards and a sufficiently high number of random accesses, attackers could gain economic benefits by using cheaper, smaller storage devices or leveraging advancements in technology to increase throughput. To mitigate these risks, the PoRA mechanism should be designed with appropriate parameters, such as limiting mining rewards and ensuring a high number of random accesses. Additionally, the comparison between NVMe storage and RAM suggests that RAM-based mining rigs could pose a significant threat to the security of the system, as they are more cost-effective per unit of throughput.

Further research

We are planning to publish soon an article with green (no PoW inside) proofs of storage, based on statistics, economics, and zkSNARK cryptography, suitable for our decentralized storage research, available at:

Links

Qi Zhou. Decentralized Storage on Large Dynamic Datasets with Applications for Large Decentralized KV Store.

qizhou · April 19, 2024, 5:36am

Should the objective of efficiency analysis be

the cost of a successful random-sampling candidate of reference > the cost of a successful random-sampling candidate of attack?

For example, given the cost of the reference is 1, and the attack cost is p < 1, the average number of random accesses to find a successful candidate is

n_a = \sum_{i=1}^{n} 1/p^n (see a simulator)

Therefore, we have (let us ignore the power cost, but it should not change the conclusion)

	Upfront Cost	Size	Lifetime Sampling Candidates	Cost Per Candidate
reference	1	1	1/n	n
attack 1	x	1	\chi/n	n x / \chi
attack 2	xp	p	\chi / n_a	n_a x p / \chi

where \chi is the performance gain of the attack device.

Suppose the 1TB disk cost is 100, and the 16GB memory cost is 40, with n=16 and \chi=8, then we can have

	Upfront Cost	Size	Lifetime Sampling Candidates	Cost Per Candidate
reference	100	1	1/16	1600
attack 1	40	16/1024	8/8e26	40e26
attack 2	640	256 /1024	8 / 5.7e9	4.5e11
attack 3	2560	1	8/16	5120

For the limited throughput case, our design is not to limit the device throughput but to limit the rate of success of the candidates. One implementation to put this constraint is to limit the nonce range of the sampling. An implementation can be found here storage-contracts-v1/contracts/StorageContract.sol at f1c9c17ef16b59c0495388672f11797eeec7848a · ethstorage/storage-contracts-v1 · GitHub

snjax · April 19, 2024, 9:49pm

We take into account not only random sampling costs but all costs, including electricity and equipment amortization. So, here we overview the case when the attacker’s costs are lower than reference costs.

Yes. And it exactly equals n \tau.

I think sampling is not related to the storage device because you can just keep on CPU miner, that everything with offset 0.9 and more is not ok and brute force the suitable nonce (with no storage utilization at this step). \chi in my note is related to a storage device (in Shrink attack) and storage with CPU (in Moore attack).

In case you have no throughput limits per miner (= reward is a linear function on throughput), the best strategy is:

store all data honestly
The bottleneck is the bus. so, the best miner is a cluster of small devices, for example with 8gb DDR5 RAM. The cluster can compute hashes in parallel and utilize the bus with the cheapest devices with a lot of throughput (but minimal storage, it does not matter, just buy more devices to store everything). Also maybe GPU is more efficient than RAM, but I have not checked it.

So the issue, in this case, is not that miners will forget something, but the usage of the cluster of tiny devices, optimized for throughput but unoptimized for storage, which increases the price of the storage for end users.

Limiting the nonce is not limiting the throughput, it is just scale recalibration. I mean, limiting the throughput means implementing things with which two x GB/s miners will receive more reward than one 2x GB/s miner. If you do not implement limiting the throughput, the issue is only RAM or GPU cluster, which is suboptimal for storage, but it will maintain 100% of data.

64 RAMs have 64 times more throughput than single RAM because it will be maintained on 64 separate tiny machines.

qizhou · April 25, 2024, 5:40pm

I am not sure what is the definition of scale recalibration. In fact, the rate-limiting approach in EthStorage will allow up to 1M (2^20) nonces per Ethereum block. Given n = 2 and sampling size = 4K, this essentially limits the throughput of a device to 1M * n * 4K = 8GB in 12s (Ethereum block time). That said, any storage device with a throughput 8GB/12 = 682MB/s can achieve the full hash rate, however, a device with a higher rate (e.g., memory) will be capped by this rate. (Unless the device can encode the data at 682MB/s at a much lower cost compared to storage. Such attack can be addressed by using an expensive SNARK-friendly encoding function).

I guess we need to clarify the definition here. Here, a successful random-sampling candidate is computed by fully sampling n random location, computing the sampled data, and returning a hash that compares with the difficulty parameter. If any random location is not stored by the node, we cannot get such a candidate and the previous random accesses (sampling) will be wasted.

The cost per candidate will contain both amortized device cost (both storage device and sampling device) and the electricity cost per candidate. This is similar to Bitcoin mining.

snjax · April 27, 2024, 1:37pm

For small nonces limit (for example, 2^{20} per Ethereum block) sampling device is not a bottleneck, and costs for \sim100 KH/s will be negligible less than storing costs. If for some cases we found, that the sampling is failed, we just break the process for this nonce and do not consume the throughput.

I see, you limit the number of samples per block by the nonce on the solidity contract level. I have not found this constraint in the paper, but it changes the algorithm, making it more Filecoin-like.

If we limit nonce by m with difficulty d, the probability to get a reward per block will be:

p_r=1 - (1-p_s/d)^m \approx 1 - \exp(-p_s m /d)

To make any q>0 economically inefficient, we need \partial_q p_r |_{q=0} < -1 .

\partial_q p_r |_{q=0} = -n \cdot m/d \cdot \exp(-m/d) < -1.

So, \theta \exp(-\theta) > 1/n, where \theta = m/d

If \theta \sim 1 and n>10, it’s enough to make q>0 economically inefficient.

But there are no lower limits on throughput, if m is small enough, HDD mining is possible, and it is more like Filecoin, not Bitcoin.

qizhou · April 27, 2024, 4:27pm

No matter whether it likes the challenge-response protocol of Filecoin or PoRA of Arweave, the key goal is to achieve

an efficient on-chain distributor to reward the storage nodes by

\frac{\text{the number of replicas of the node}}{\text{total number of replicas in the network}}

where the replicas can be dynamic over time, and the data in the replicas can be slowly changed.

For this goal, the challenge-response protocol of Filecoin has

reward each replica in exact 1/r per challenge-response window (30 mins), where r is number of replicas in the network
the communication cost is O(r) per challenge-response window
the minimum change unit is 32GB (sector size in Filecoin), which is unfriendly for dynamic storage (e.g., KV-store) with a much smaller value size

The design of EthStorage uses limited throughput of PoRA or data-availability sampling over time (DASoT) such that

reward each replica 1/r statistically over time
the communication cost is O(1) per target proof-interval (e.g., our testnet is about 3 hours) no matter what r is
the minimum change unit is 128KB (EIP-4844 BLOB size), which is friendly for dynamic storage like KV-store.

So I would say the limited throughput of PoRA harvests the benefits of both the challenge-response protocol of Filecoin (exact per replica reward) and PoRA (efficient in O(1) communication cost)). Note that the latest version of Arweave also uses a limited throughput of PoRA (see).

snjax:

If we limit nonce by mmm with difficulty ddd, the probability to get a reward per block will be:

p_r=1 - (1-p_s/d)^m \approx 1 - \exp(-p_s m /d)pr=1−(1−ps/d)m≈1−exp(−psm/d)p_r=1 - (1-p_s/d)^m \approx 1 - \exp(-p_s m /d)

To make any q>0q>0q>0 economically inefficient, we need \partial_q p_r |{q=0} < -1 ∂qpr|q=0<−1\partial_q p_r |{q=0} < -1 .

\partial_q p_r |{q=0} = -n \cdot m/d \cdot \exp(-m/d) < -1∂qpr|q=0=−n⋅m/d⋅exp(−m/d)<−1\partial_q p_r |{q=0} = -n \cdot m/d \cdot \exp(-m/d) < -1.

So, \theta \exp(-\theta) > 1/nθexp(−θ)>1/n\theta \exp(-\theta) > 1/n, where \theta = m/dθ=m/d\theta = m/d

[image]

If \theta \sim 1θ∼1\theta \sim 1 and n>10n>10n>10, it’s enough to make q>0q>0q>0 economically inefficient.

But there are no lower limits on throughput, if mmm is small enough, HDD mining is possible, and it is more like Filecoin, not Bitcoin.

I have not quite followed the analysis here. In our design, m/d << 1 with m = 2^{20}, and d \approx r * 900 * 2^{20}, where 900 * 12 = 3 hours is the target proof interval. In our testnet on Ethereum Sepolia, d = 120e9, where r \approx 127.

qizhou · April 27, 2024, 4:32pm

HDD is possible with a proper access pattern and data organization on a disk (e.g., using a random offset of a sequential read). Further, we may allow the algorithm to support detecting both HDD and SSD storage and distribute different rewards to them.

snjax · April 27, 2024, 9:19pm

There was a mistake. Here is the correct analysis:

Let the node store the p part of the data. And q=1-p is the missing part.
Then the probability of picking n samples from the stored part is p_s=p^n \approx \exp(-qn).
Let’s d is difficult. Then the probability of solving the difficulty is p_d=1/d.
The probability of not getting successful mining from a single sampling is 1 - p_s \cdot p_d, because p_s and p_d are independent.
Then the probability of not getting successful mining for m nonces is (1-p_s \cdot p_d)^m \approx \exp(-m p_s p_d).
The probability of getting at least one success and the reward in the block is p_r = 1 - \exp(-m p_s p_d).

Let’s set the mathematical expectation of the reward when q=0 is 1. Let’s neglect the cost of computation because m is small. Then economics equilibrium needs the maintenance cost for the storage for one block to be the same 1.

Now let’s consider what happens when q>0. The maintenance cost is 1-q, the mathematical expectation of the reward is R=p_r/(p_r|_{q=0})

To make q>0 economically inefficient, we need

\partial_q R |_{q=0} < \partial_q (1-q) |_{q=0} = -1.

Then the reward will decrease faster than the benefits from partial maintenance.

\partial_q R |_{q=0} = (m/d \exp(-m/d \cdot p_s) \partial_q p_s )|_{q=0}/p_r|_{q=0}=\\ -m/d \cdot n \cdot \exp(-m/d)/(1-exp(-m/d)) < -1.

\theta \exp(-\theta)/(1-\exp(-\theta)) > 1/n

\theta \ll 1 is also safe. Only big \theta which means that the miner can select one of multiple possible mineable samples leads to vulnerability.

So, limited throughput (by nonces per challenge) PoRA is safe. My first message here was about the original paper.

zkSNARKs can solve this issue also. What do you think about the following alternative:

Rare challenges
Each challenge should be replied or the miner will be slashed

The issue is how to force miners to maintain the infrastructure during market stress. For example, if the protocol token price falls 10 times during a day, it should not lead to a spiral of death, when the miners instantly switch to competitors and deal damage to the network, because they lose short-term economic incentives.

Slashing is working as a dollar auction in this case.

qizhou · April 29, 2024, 9:48pm

That is why we use ETH as the protocol token so that the sudden price drop should not be the problem.

In the long term, any significant price drop would hurt the system even like BTC / ETH.

One main cost of challenge-response is the communication cost O(r), which is not scaled if r is large or the on-chain verification is expensive (e.g., when ETH is congested).