Computer latency scaled

We all love to talk about Architecture - micro services, message passing, event streams, sow, etc. It all boils down to the below.

Can you fit your work in the cache or do you regularly fall off to the main memory? What is the penalty for calling another service? Or can it be faster than accessing the disk? You should be able to talk about these roughly for your application.

If you are into computers, best to know this stuff -

p.s. Also, this is why you should use an ArrayList over a LinkedList most days

Computer Latency at a Human Scale

System Event	Actual Latency	Scaled Latency
One CPU cycle	0.4 ns	1 s
Level 1 cache access (8 KB to 128 KB) - 128 KB	0.9 ns	2 s
Level 2 cache access (256KB to 8MB) - 512 KB	2.8 ns	7 s
Level 3 cache access ( 4MB to 50MB) - 3 MB	28 ns	1 min
Main memory access (DDR DIMM)	~100 ns	4 min
Intel Optane memory access	<10 μs	7 hrs
NVMe SSD I/O	~25 μs	17 hrs
SSD I/O	50–150 μs	1.5–4 days
Rotational disk I/O	1–10 ms	1–9 months
Internet call: San Francisco to New York City	65 ms	5 years
Internet call: San Francisco to Hong Kong	141 ms	11 years

Latency Numbers Every Programmer Should Know

(don't worry too much about the numbers not 100% matching with the above - different times, different architectures etc..)

execute typical instruction	1 ns	1 ns
fetch from L1 cache memory	0.5 ns	0.5 ns
branch misprediction	5 ns	5 ns
fetch from L2 cache memory	7 ns	7 ns
Mutex lock/unlock	25 ns	25 ns
fetch from main memory	100 ns	100 ns
send 2K bytes over 1Gbps network	20,000 ns	20 μs
read 1MB sequentially from memory	250,000 ns	250 μs
fetch from new disk location (seek)	8,000,000 ns	8 ms
read 1MB sequentially from disk	20,000,000 ns	20 ms
send packet US to Europe and back	150,000,000 ns	150 ms

About Me

yeah but actually

Monday, 1 March 2021

Computer latency scaled

No comments:

Post a Comment

Followers