Is Argon2 better than Bcrypt?

Is Argon2 actually better than Bcrypt?

This blog post was published on March 22, 2026.

The first rule of user passwords is never store them as plain text. The second rule is to use Bcrypt. Argon2 is a newer algorithm that gets recommended over Bcrypt, but the main principle is still the same: use a slow hashing algorithm that's designed for passwords.

But, why is Argon2 better than Bcrypt? On the surface, Argon2 has less foot-guns that Bcrypt. Bcrypt has a maximum password length of 72 bytes and that 72 bytes needs to be a null-terminated string. Argon2 on the other hand doesn't have a password length limit and works with any binary data. I think this alone is a good enough reason to use Argon2 over Bcrypt. Still, newer algorithms aren't always more secure. Is it possible that Argon2 is weaker than Bcrypt?

Password hashing generally needs to complete Password cracking is mostly done with GPUs these days because they're much better at hashing than CPUs. Hashing is just a series of small computation and the GPU's many tiny compute units excel at these kind of work. The table below shows hashing speeds of SHA-256, a fast and simple hashing algorithm, between different hardware. The CPU benchmarks were ran using Go's crypto/sha2 standard library package and GPU benchmarks are from publicly available benchmarks. The device cost is either the rough second-hand market price or the retail price, whichever is cheaper.

Note that all GPU benchmarks referenced in this blog post were collected from the internet and published by different users. I highly doubt that the data is outright fake but it is still possible that some data is inaccurate.

SHA-256 hashing performance

Device Hashing speed (hashes per second) Cost (hashes per second per dollar) Power efficiency (hashes per second per watt)

Hetzner CCX23 (CPU) 40,365,462

MacBook Air M1 8 cores (CPU, 2020) 90,800,749 302,669 3,026,691

MacBook Pro M3 Pro 11 cores (CPU, 2023) 171,638,888 156,177 5,721,296

GTX 1080 (GPU, 2016) 2,439,500,000 16,263,333 13,552,777

RTX 5090 (GPU, 2025) 28,353,300,000 14,183,741 48,885,000

The CPU numbers can be improved further but we see that GPUs are 50 to 100 times faster and 5 to 10 times more efficient.

The design of Argon2 attempts to close this gap by using a significant amount of memory. Argon2 can be configured to use anywhere from a few kilobytes to gigabytes of memory per hash. While GPUs are fast at pure computation, its memory (VRAM) has a limit on how fast it can transfer data (memory bandwidth) like regular RAM. Argon2's massive memory usage bottlenecks the GPU's memory bandwidth and puts a hard limit on how fast the GPU can calculate hashes. The faster L1 and L2 cache aren't useful either as these aren't large enough to handle multiple processes at once. Additionally, once you set the memory configuration high enough, the GPU quickly runs out of memory for more processes and has to put many of its compute units in idle.

Argon2 has 3 parameters: memory size, iteration count, and degree of parallelism. For a memory size m bytes and iteration count t with parallelism set to 1 (single thread), the total number of bytes read and written to memory is calculated by:

(3 × t - 1) × m

For example, Argon2 at 16 mebibytes and 3 iterations will read and write approximately 128 mebibytes. Argon2 also has 3 variations (Argon2i, Argon2d, Argon2id) to be exact but they have similar performance when using the same parameters so I'll be grouping them as the same algorithm.

The table below shows the estimated hashing speed based on the GPU's bandwidth and the actual hashing speeds at different memory configuration for the Nvidia GTX 1080 (2016) and Nvidia RTX 5090 (2025) GPU. The actual hashing speed are from benchmarks ran with John the Ripper, an open-source password cracking tool.

Memory size Bandwidth-estimated hashing speed (hashes per second) Actual hashing speed (hashes per second)

16 MiB 2,441 1,742

64 MiB 610 387

256 MiB 153 34

Argon2 with varying memory configuration at 3 iterations and 1 degree of parallelism on a GTX 1080

Memory size Bandwidth-estimated hashing speed (hashes per second) Actual hashing speed (hashes per second)

16 MiB 13,672 11,465

64 MiB 3,418 2,400

256 MiB 854 178

Argon2 with varying memory configuration at 3 iterations and 1 degree of parallelism on an RTX 5090

On both the GTX 1080 and RTX 5090, the benchmark numbers generally align with our estimate at 16 mebibytes and 64 mebibytes of memory. As expected, the GPUs also see a linear slowdown as it needs to read and write more bytes. However, the GPU experiences a quadratic slowdown when the memory parameter is increased to 256 mebibytes from 64 mebibytes. This indicates that both GPUs are limited by its memory size at around 64 mebibytes. After this point, the GPU has to do twice the work with half the compute units when the memory requirement doubles. I'm unsure why the GTX 1080 sees lower...

Is Argon2 better than Bcrypt?

Related Articles

(no title)

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

ZCode – Harness for GLM-5.2

Apertus – Open Foundation Model for Sovereign AI