Chinese Arm takes integer performance crown?

The law doesn’t work as well for a PC network where the longest sequential thread can be distributed to all available threads.

Then it is no longer the highest pole in the tent. Not a complaint, I used to do a lot of lowering tentpoles. Even though at the time I was most concerned with reducing system loads on our local Multics server (BCO). Many times cutting down a particular thread ended up reducing the total CPU time in addition to finishing faster.

I think we are about to see AI workloads dealt with by adding hardware instructions. There have been a lot of x86/x64 SIMD instructions added to deal with matrix multiplication. Right now neural net instructions are starting to show up. The alternative is to use FPGAs, but since the neural net processing tends to be a client application, I think that both AMD and Intel see adding that to desktop, laptop, and smartphones.