scrypt

In cryptography, scrypt is a password-based key derivation function created by Colin Percival, originally for the Tarsnap online backup service.^[1] The algorithm was specifically designed to make it costly to perform large-scale custom hardware attacks by requiring large amounts of memory. In 2012, the scrypt algorithm was published by IETF as an Internet Draft, intended to become an informational RFC.^[2] A simplified version of scrypt is used as a proof-of-work scheme by a number of cryptocurrencies first implemented by Litecoin.^[3]

Introduction

A password-based key derivation function (password-based KDF) is generally designed to be computationally intensive, so that it takes a relatively long time to compute (say on the order of several hundred milliseconds). Legitimate users only need to perform the function once per operation (e.g., authentication), and so the time required is negligible. However, a brute-force attack would likely need to perform the operation billions of times, at which point the time requirements become significant and, ideally, prohibitive.

Previous password-based KDFs (such as the popular PBKDF2 from RSA Laboratories) have relatively low resource demands, meaning they do not require elaborate hardware or very much memory to perform. They are therefore easily and cheaply implemented in hardware (for instance on an ASIC or even an FPGA). This allows an attacker with sufficient resources to launch a large-scale parallel attack by building hundreds or even thousands of implementations of the algorithm in hardware and having each search a different subset of the key space. This divides the amount of time needed to complete a brute-force attack by the number of implementations available, very possibly bringing it down to a reasonable time frame.

The scrypt function is designed to hinder such attempts by raising the resource demands of the algorithm. Specifically, the algorithm is designed to use a large amount of memory compared to other password-based KDFs,^[4] making the size and the cost of a hardware implementation much more expensive, and therefore limiting the amount of parallelism an attacker can use, for a given amount of financial resources.

Overview

The large memory requirements of scrypt come from a large vector of pseudorandom bit strings that are generated as part of the algorithm. Once the vector is generated, the elements of it are accessed in a pseudo-random order and combined to produce the derived key. A straightforward implementation would need to keep the entire vector in random-access memory so that it can be accessed as needed.

Because the elements of the vector are generated algorithmically, each element could be generated on the fly as needed, only storing one element in memory at a time and therefore cutting the memory requirements significantly. However, the generation of each element is intended to be computationally expensive, and the elements are expected to be accessed many times throughout the execution of the function. Thus there is a significant trade-off in speed in order to get rid of the large memory requirements.

This sort of time–memory trade-off often exists in computer algorithms: you can increase speed at the cost of using more memory, or decrease memory requirements at the cost of performing more operations and taking longer. The idea behind scrypt is to deliberately make this trade-off costly in either direction. Thus an attacker could use an implementation that doesn't require many resources (and can therefore be massively parallelized with limited expense) but runs very slowly, or use an implementation that runs more quickly but has very large memory requirements and is therefore more expensive to parallelize.

Algorithm

The algorithm includes the following parametres:

MFLen - Length of block mixed by SMix() , in octets.
hLen - (32) Length of output produced by HMAC_SHA256() , in octets.
dkLen - Intended output length in octets of the derived key; a positive integer satisfying dkLen ≤ (2³²− 1) * hLen.
N - CPU/memory cost parameter.
p - Parallelization parameter; a positive integer satisfying p ≤ (2³²− 1) * hLen / MFLen.

Function scrypt(Passphrase,Salt,N,p,dkLen):

(B₀ ... B_p−1) ← PBKDF2_{HMAC_SHA256}(Passphrase, Salt, 1, p * MFLen)
for i = 0 to p-1 do
    B_i ← SMix(B_i,N)
end for
Output ← PBKDF2_{HMAC_SHA256}(Passphrase, B₀ || B₁ ... B_p−1, 1, dkLen)

Function SMix(B,N):

X ← B
for i = 0 to N − 1 do
    V_i ← X
    X ← BlockMix(X)
end for
for i = 0 to N − 1 do
    j ← Integerify(X) mod N
    X ← BlockMix(X ⊕ V_j)
end for
Output ← X

Integerify() is a bijective function from {0, 1}^k to {0,...,2^k− 1}.

Function BlockMix(B):

(B₀, ... , B_2r-1) ← B
X ← B_2r−1
for i = 0 to 2r − 1 do
    X ← H(X ⊕ B_i)
    Y_i ← X
end for
Output ← (Y₀, Y₂, ... , Y_2r−2, Y₁, Y₃, ... , Y_2r−1)

Proof-of-work in cryptocurrency operations

Scrypt has been used in many cryptocurrencies since Tenebrix first implemented it as an alternate proof-of-work algorithm in September 2011.^[5] Mining of cryptocurrencies that use scrypt as a proof-of-work function is often performed on graphics processing units (GPUs) since GPUs tend to have significantly more processing power compared to the CPU.^[6] This led to shortages of high end GPUs due to the rising price of these currencies in the months of November and December 2013.^[7]

As of May 2014, specialized ASIC mining hardware is available for scrypt-based cryptocurrencies.^[8]

References

↑ "scrypt page on the Tarsnap website". Retrieved 21 January 2014.
↑ C. Percival, S. Josefsson (2012-09-17). "The scrypt Password-Based Key Derivation Function". IETF.
↑ Alec Liu. "Beyond Bitcoin: A Guide to the Most Promising Cryptocurrencies".
↑ Stronger Key Derivation Via Sequential Memory-Hard Functions, Colin Percival
↑ "History of cryptocurrency". Retrieved 27 June 2014.
↑ Roman Guelfi-Gibbs. Litecoin Scrypt Mining Configurations for Radeon 7950. Amazon Digital Services.
↑ Joel Hruska (10 December 2013). "Massive surge in Litecoin mining leads to graphics card shortage". ExtremeTech.
↑ Caleb Chen (2014-05-21). "Zeusminer Delivers Lightning, Thunder, and Cyclone Scrypt ASICs For Litecoin And Dogecoin Mining".

External links

Cryptography

History of cryptography Cryptanalysis Cryptography portal Outline of cryptography

Symmetric-key algorithm Block cipher Stream cipher Public-key cryptography Cryptographic hash function Message authentication code Random numbers Steganography