Different units — each names a different bottleneck.
QPS, RPS, IOPS, BPS: different units
make the invisible limits visible
Bigger RAM raises page-cache hit rate.
Throughput is measured in different units depending on which layer you mean. QPS/RPS at the app, IOPS at storage, BPS at the network. Scaling problems usually live in one unit but get described in another — clarity matters.
QPS/RPS: queries or requests per second at the app or DB. Limited by CPU, locks, or downstream.
IOPS: storage operations per second; commodity SSDs ~10–100k. Many DB problems are really IOPS problems.
BPS: bytes per second; matters for media, analytics, replication.
Always derive: e.g. 1k RPS × 5 reads/req × 10 IOPS/read ≈ 50k IOPS.
API hits 5k RPS, DB CPU 30%, but slow.