Approximating Deep Neural Networks via Custom Arithmetic: a Tradeoff Between Accuracy and Power Consumption

People

(Responsible)

Abstract

Neural networks (NNs) have become a ubiquitous and essential element of modern computing, driving advancements in fields ranging from computer vision and natural language processing to autonomous vehicles and medical diagnostics. Despite their widespread and undisputed utility, however, NNs are inherently extremely resource-intensive, and training large models, especially for deep neural networks (DNNs), requires enormous computational power, with recent studies suggesting that training large-scale models is contributing to significant environmental impact, and that training a single state-of-the-art language model can emit as much carbon dioxide as five cars over their entire lifetimes~\cite{StrubellJan19, StrubellFeb20}. This project aims at exploring the efficient approximation of DNNs, specifically though the use of custom arithmetic components. Exploiting the knowhow and experience gained by my research in the field of Approximate Computing, and in particular in the design of Custom Arithmetic Components, this project will explore novel approximation avenues for deploying neural networks at scale that carefully balance the tradeoff between a limited loss in accuracy for a substantial energy saving.

Additional information

Start date

01.05.2025

End date

31.10.2025

Duration

7 Months

Funding sources

Hasler Foundation

Status

Ended

People

Education

Research

Organisation

Approximating Deep Neural Networks via Custom Arithmetic: a Tradeoff Between Accuracy and Power Consumption

People

Abstract

Additional information

Faculties

Organizational units

Maps and directions

Stay in touch