How AI will change computers

In my recent post about Neural Processors, I noted that just about everyone who can – Google, Apple, IBM, Intel & more – has built neural processors to accelerate neural networks. They are mostly deployed as co-processors to run AI models, but as demand for intelligent applications grows, systems will have to do more to adapt.

Why? Because AI systems have unique I/O requirements. That’s why the neural processors don’t have caches, nor do they need floating point numbers – and their I/O overhead.

In the joint application of computer vision, moving many frames of the high-res video also stresses the I/O subsystem. Recurrent neural networks focus on streaming data, another bandwidth intensive application.

How AI will change computers 1

We’re able to limp along today because this is AI’s early days – much as 8-bit processors in 70s PCs worked fine – and it’s only as capacity and performance requirements grow that system architectures will have to change.

Architects and computer scientists are still learning how to optimize data structures and data representation for performance. Even so, it is painfully clear that standard x86 architectures will never become preferred AI platforms.

As a result, unlike memory busses today, a DNN is most efficient when DRAM bandwidth is evenly partitioned across the DRAM ports. Which means that other logic, designed for full memory busses, such as multiplexers, is not required either.

Since DRAM can account for as much as 90 percent of energy consumption, minimizing memory logic, and using memory efficiently, can be a significant cost saving for mobile devices – or a warehouse-scale computer.

Memory accesses aren’t the only, or even always the most important, a difference between traditional and AI workloads. But there is no doubt that as AI applications grow in sophistication, current architectures – x86 and ARM – will be less and less relevant.
The Storage Bits take

In the next post, I’ll discuss further the implications that the widespread use of AI applications will have on CPU and server architectures. Suffice it to say that if AI applications become widespread – and I believe they will – a new generation of CPUs will be required to run them efficiently and quickly.

Johnny J. Hernandez
Zombie aficionado. Beer practitioner. Coffee geek. Total alcohol maven. Freelance reader. Spent the better part of the 90's creating marketing channels for trumpets in Jacksonville, FL. Spent a weekend working on chess sets in Mexico. Spent a weekend creating marketing channels for Magic 8-Balls in Hanford, CA. Spoke at an international conference about developing inflatable dolls in Las Vegas, NV. Had some great experience importing muffins in the UK. Had a brief career getting my feet wet with crayon art in Pensacola, FL.