Chinese Arm takes integer performance crown?

eachus · July 18, 2022, 1:07am

The law doesn’t work as well for a PC network where the longest sequential thread can be distributed to all available threads.

Then it is no longer the highest pole in the tent. Not a complaint, I used to do a lot of lowering tentpoles. Even though at the time I was most concerned with reducing system loads on our local Multics server (BCO). Many times cutting down a particular thread ended up reducing the total CPU time in addition to finishing faster.

I think we are about to see AI workloads dealt with by adding hardware instructions. There have been a lot of x86/x64 SIMD instructions added to deal with matrix multiplication. Right now neural net instructions are starting to show up. The alternative is to use FPGAs, but since the neural net processing tends to be a client application, I think that both AMD and Intel see adding that to desktop, laptop, and smartphones.

Topic		Replies	Views
Semi-OT: Alibaba using homegrown ARM CPUs Stocks A to Z advanced-micro-devices	0	76	June 14, 2022
A different view of the server market Stocks A to Z advanced-micro-devices	4	95	August 29, 2022
Big interview unpacked Stocks A to Z advanced-micro-devices	15	336	June 21, 2022
Tachyum Universal Processor claims HPC customer Stocks A to Z advanced-micro-devices	8	493	October 21, 2023
Power10: Still great for some workloads Stocks A to Z advanced-micro-devices	0	109	September 8, 2022

Chinese Arm takes integer performance crown?

Related topics