IBM Research in San Jose, California Develops New Neural Computer Architecture – NorthPole

Dr. Dharmendra S. Modha, Ph.D., graduate of UCSD Engineering, and his team, developed a computer chip with a neural inspired architecture, called NorthPole, where memory and computation are integrated, that achieves substantially higher performance, energy efficiency, and area efficiency compared with other architectures. This computer obviates the need to access external memory, thus greatly increasing efficiency.

A brain-inspired computer chip architecture that may enhance artificial intelligence (AI) by working faster with much less power has been developed by researchers at IBM in San Jose, California. Their massive NorthPole processor chip eliminates the need to frequently access external memory, and so performs tasks such as image recognition faster than existing architectures doing so while consuming vastly less power. You’ve heard about the massive amount of energy consumption by AI computer systems and volumes of water needed to cool the AI computer banks – this architecture will help.

“Its energy efficiency is just mind-blowing,” says Damien Querlioz, a nanoelectronics researcher at the University of Paris-Saclay in Palaiseau. The work, published in Science1, shows that computing and memory can be integrated on a large scale, he says. “I feel the paper will shake the common thinking in computer architecture.”

NorthPole runs neural networks: multi-layered arrays of simple computational units programmed to recognize patterns in data. A bottom layer takes in data, such as the pixels in an image; each successive layer detects patterns of increasing complexity and passes information on to the next layer. The top layer produces an output that, for example, can express how likely an image is to contain a cat, a car or other objects.

Some computer chips can handle these calculations efficiently, but they still need to use external memory called RAM each time they calculate a layer. Shuttling data between chips in this way slows things down — a phenomenon known as the Von Neumann bottleneck, after mathematician John von Neumann, who first conceived the standard architecture of computers based on a processing unit and a separate memory unit.

A key feature of this chip is the understanding that for almost all computational tasks, access to memory plays as important a role as logic processing. Unlike analog in-memory computing, this purely digital system has the option of tailoring the bit precision as needed, which allows for optimization of the power usage.

The Von Neumann bottleneck is one of the most significant factors that slow computer applications — including AI. It also results in energy inefficiencies. Study co-author Dharmendra Modha, a computer engineer at IBM, says he once estimated that simulating a human brain on this type of architecture might require the equivalent of the output of 12 nuclear reactors. BTW, the human brain is estimated to consists of 100 billion neurons and over 100 trillion synaptic connections – it’s still the best computer in the universe.

NorthPole is made of 256 computing units, or cores, each of which contains its own memory. “You’re mitigating the Von Neumann bottleneck within a core,” says Modha, who is IBM’s chief scientist for brain-inspired computing at the company’s Almaden research centre in San Jose.

The cores are wired together in a network inspired by the white-matter connections between parts of the human cerebral cortex, Modha says. This and other design principles — most of which existed before but had never been combined in one chip — enable NorthPole to beat existing AI machines by a substantial margin in standard benchmark tests of image recognition. It also uses one-fifth of the energy of state-of-the-art AI chips, despite not using the most recent and most miniaturized manufacturing processes. If the NorthPole design were implemented with the most up-to-date manufacturing process, its efficiency would be 25 times better than that of current designs, the authors estimate.

But Problems Remain to be Solved

But even NorthPole’s 224 megabytes of RAM are not enough for large language models, such as those used by the chatbot ChatGPT, which take up several thousand megabytes of data even in their most stripped-down versions. And the chip can run only pre-programmed neural networks that need to be ‘trained’ in advance on a separate machine. But the paper’s authors say that the NorthPole architecture could be useful in speed-critical applications, such as self-driving cars.

NorthPole brings memory units as physically close as possible to the computing elements in the core. Elsewhere, researchers have been developing more-radical innovations using new materials and manufacturing processes. These enable the memory units themselves to perform calculations, which in principle could boost both speed and efficiency even further.

Another chip, described last month2, does in-memory calculations using memristors, circuit elements able to switch between being a resistor and a conductor. “Both approaches, IBM’s and ours, hold promise in mitigating latency and reducing the energy costs associated with data transfers,” says Bin Gao at Tsinghua University, Beijing, who co-authored the memristor study.

Another approach, developed by several teams — including one at a separate IBM lab in Zurich, Switzerland3 — stores information by changing a circuit element’s crystal structure. It remains to be seen whether these newer approaches can be scaled up economically.

doi: https://doi.org/10.1038/d41586-023-03267-0

References

  1. Modha, D. S. et al. Science 382, 329–335 (2023).Article Google Scholar 
  2. Zhang, W. et al. Science 381, 1205–1211 (2023).Article PubMed Google Scholar 
  3. Le Gallo, M. et al. Nature Electron. 6, 680–693 (2023).Article Google Scholar 

Published by Dr. Greg Maguire, Ph.D.

Dr. Maguire, a Fulbright-Fogarty Fellow at the National Institutes of Health, is a scientist, innovator, teacher, healthcare professional. He has over 100 publications and numerous patents. His book, "Adult Stem Cell Released Molecules: A Paradigm Shift To Systems Therapeutics" was published by Nova Science Publishers in 2018.

Leave a comment