Stripes: Bit-Serial Deep Neural Network Computing

March 08, 2018

Authors: Patrick Judd, Jorge Albericio, Tayler Hetherington, Tor M. Aamodt, Andreas Moshovos
Venue: MICRO 2016

Bit-Stripes presents are architecture which is able to scale almost linearly with the bit-precision width for neural networks. They do a design space exploration to find the minimum number of bits required for different networks to maintain within 1% accuracy of the original network. The paper takes a DaDianNao-like approach in terms of hardware, but thanks to the per-layer precision optimizations consumes less energy, and operates faster.

Full Text

Search This Blog

Karl Taht's Research Paper Blog

Stripes: Bit-Serial Deep Neural Network Computing

Comments

Post a Comment

Popular posts from this blog

Communist, Utilitarian, and Capitalist Cache Policies on CMPs: Caches as a Shared Resource

Continuous Control with Deep Reinforcement Learning (DDPG)

Fundamental Latency Trade-offs in Architecting DRAM Caches (Alloy Cache)