Solidity – Why Does Gas Cost of Array Push Operation Remain the Same?

arrayssolidity

I'm doing a course project of a blockchain class to test the complexity of a data structure in terms of gas cost. I noticed that the cost of the push of array in solidity keeps the same. If we have a dynamic array, we must have a point where the length of the array will be doubled and the data will be copied. In this case, the gas cost will be larger. How can solidity make cost of push constant? Does solidity make a very large array in the initialization and then the push cost can be constant?

Best Answer

Pushing can only be done on resizable arrays, which live in storage.

Now storage is handled quite differently from the heap, such as in C for example. This is mostly because it is fully accessible (all 2^256 slots of 32 bytes), you won't be running short on "allocated" storage in solidity, you'd most certainly run out of money way before...

So :

if we have a dynamic array, we must have a point where the length of the array will be doubled and the data will be copied

Not necessarily no, this scheme is used as a heuristic to minimize memory allocation which is costly and to optimize memory usage with is actually scarce, but you could rely on a different one. You seem to state it as a rule, which it is not. Now, to understand what happens exactly in solidity when you use .push() you need to understand the storage layout of dynamic arrays as explained here and more specifically the scale of the storage space.

Let's use an example contract such as this one :

// SPDX-License-Identifier: MIT
pragma solidity ^0.8.0;

contract Example {

  uint256[] myArray;

  function addElement(uint256 value) public {
    myArray.push(value);
  }
}

the dynamic storage array myArray is declared at storage slot 0 (simply because it is the first state variable), this slot will contain the size of the array. Its data will be stored starting at keccak256(0).

In order, the elements of the array will be stored at :

Index 0 : keccak256(0)
Index 1 : keccak256(0) + 1
Index 2 : keccak256(0) + 2
...
Index n : keccak256(0) + n

The important bit is that keccak256(x) can be interpreted as a 256 bit address in a memory space with 2^256 slots each of 32 bytes. That's huge ! (2^261 bytes) We are way further than the Megabytes / Gigabytes of physical memory limitations that do require a memory allocator.

To put things into perspectives, 2^261 bytes of "available" memory is enough for 387973554649592116040087854974761818637655473 Yotta Bytes (2^80 bytes) for every living human on earth currently. And that's the limit for a single account storage... The practical limit is of course way less.

I'm actually amazed by this, I used that code to compute it, if anyone spots a mistake :

console.log(
  web3.utils
    .toBN("2")
    .pow(web3.utils.toBN("261"))
    .div(web3.utils.toBN("1024")) // To KiB
    .div(web3.utils.toBN("1024")) // To MiB
    .div(web3.utils.toBN("1024")) // To GiB
    .div(web3.utils.toBN("1024")) // To TiB
    .div(web3.utils.toBN("1024")) // To PiB
    .div(web3.utils.toBN("1024")) // To EiB
    .div(web3.utils.toBN("1024")) // To ZiB
    .div(web3.utils.toBN("1024")) // To YiB
    .div(web3.utils.toBN("7900000000")) // Divide by world population
    .toString()
);

In this context, the decision was made to rely on the extremely low probability of collisions between slots rather than implement a proper memory allocation scheme that would be quite costly in gas and actually useless (like 99.9999999... % of the time).

How can solidity make cost of push constant?

Actually, simply by writing on the next slot : keccak256(arraySlot) + arrayLength

Does solidity make a very large array in the initialization and then the push cost can be constant?

If you want to see it that way, yes… It's more sparse data (values, struct or arrays for examples) in a really, really huge memory space. The storage space is entirely available, but only what is stored on it is really stored in the form of key, value pairs.

Related Solutions

[Ethereum] Returning 2D array with dynamic sizes from a function

It's not possible yet. See Solidity 0.4.21 FAQ here (search for "dynamic array") -> http://solidity.readthedocs.io/en/develop/frequently-asked-questions.html

It is also not possible to return string[] since technically it's also a 2-dim array.

Solidity – Location of Dynamic Array Values in Smart Contract Storage

Q) Where are the values of dynamic arrays stored in solidity?

A) A dynamically-sized array needs a place to store its size as well as its elements.

contract StorageTest {
    uint256 a;     // slot 0
    uint256[2] b;  // slots 1-2

    struct Entry {
        uint256 id;
        uint256 value;
    }
    Entry c;       // slots 3-4
    Entry[] d;     //slot 5
}

In the above code, the dynamically-sized array d is at slot 5, but the only thing that’s stored there is the size of d. The values in the array are stored consecutively starting at the hash of the slot.

The following Solidity function computes the location of an element of a dynamically-sized array:


//elementSize => Number of Bits element takes.
// Example: (uint128 => elementSize = 128)
function arrLocation(uint256 slot, uint256 index, uint256 elementSize) public pure returns (uint256) {
        return uint256(keccak256(abi.encodePacked(slot))) + (index * elementSize/256) ;
    }

So slot 5 of your contract memory keeps the length of your dynamic array, and the location of the first element is computed like so: hash(slot) + (index)

You can find a lot more information here.

Hope this helps!

Best Answer

Related Solutions

[Ethereum] Returning 2D array with dynamic sizes from a function

Solidity – Location of Dynamic Array Values in Smart Contract Storage

Related Topic