2025: Power Efficiency and Computational Flexibility in AI/ML Processors

VINCENZO LIGUORI, owner, Ocean Logic

Energy efficiency is becoming a critical issue for AI/ML processors. This concern extends not only to IoT devices but also to large data centers, where even nuclear power is being considered. This issue is closely tied to computational efficiency and flexibility.

I expect AI/ML efficiency to be a major trend in 2025. At Ocean Logic, we have been focusing on addressing these often-conflicting goals. Compression, especially after quantization, can help reduce the bandwidth and large storage required by model weights, thereby reducing power consumption.

Our approach involves simultaneously compressing the weights and supporting a variety of user-defined floating point (fp), posit, and integer representations. The implementation is straightforward and allows direct hardware support for well-known formats such as INT8, BFLOAT16, FP8, etc., as well as for integers of different sizes and fp numbers with user-defined exponent and mantissa.

For example, the BFLOAT16 weights of LLama 2 7B can be losslessly compressed by approximately 1.5 times, outperforming both GZIP and BZIP2 with significantly fewer resources and without requiring a large buffer memory for decompression. Weight compression also works well after quantization: the same LLama 2 7B, after being quantized to 7 bits (a lossy process), can then be losslessly compressed to around 3.4 bits.

Furthermore, the AI/ML field is far from settled, with new models continuously emerging, presenting another challenge. Low-resource fp hardware can alleviate this uncertainty by supporting more models more easily.

Exponent Indexed Accumulators (ExIA) are an extremely simple architecture for adding long sequences of fp numbers. They operate in two stages: an accumulation stage where partial results are accumulated and a reconstruction stage where the result is finalized. The ExIAresult is exact, potentially hundreds of bits long.

ExIA also do not require normalized fp numbers as input, allowing them to accept integers and fixed point numbers without conversion. They canbe easily fused with a multiplier, providing an efficient MAC.

In FPGAs, ExIA are extremely compact: a BFLOAT16 MAC, capable of an addition and a multiplication every clock cycle, occupies less than 100 LUTs and 1 DSP, producing an exact result more than 256 bits long. ExIA also have the advantage that, with each fp addition, the amount of logic switching is similar to that of an integer accumulator, with clear power implications.

These two technologies should be able to make a meaningful impact in the design of AI/ML processors. I’m looking forward to an interesting 2025.

Click here to read the 2025 Executive Viewpoints in Semiconductor Digest

Pete Singer

Pete has over 40 years of publishing experience. He co-founded Semiconductor Digest and the Gold Flag Media company with publisher Kerry Hoffman in 2019. Previously, he spent over 25 years at Semiconductor International and 11 years at Solid State Technology.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_142332005_1	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
_pk_id.56353.85f6	1 year 27 days	This cookie is set by Google Analytics and is used to store a unique user ID for statistical purposes.
_pk_ses.56353.85f6	30 minutes	This cookie is created by Piwik PRO to store a unique session ID.
CONSENT	16 years 5 months 18 days 3 hours	These cookies are set via embedded youtube-videos. They register anonymous statistical data on for example how many times the video is displayed and what settings are used for playback.No sensitive data is collected unless you log in to your google account, in that case your choices are linked with your account, for example if you click “like” on a video.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick (which we don't use) and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
optin	1 hour	This cookie tracks users who take an affirmative action, such as checking a tick-box or another similar action. The consent is used for a variety of purposes, such as agreeing to terms and conditions, signing up for online content like newsletters and resources, consenting to the use of cookies, and more.
yt-remote-connected-devices	never	Stores the user's video player preferences using embedded YouTube video.
yt-remote-device-id	never	Stores the user's video player preferences using embedded YouTube video.

2025: Power Efficiency and Computational Flexibility in AI/ML Processors

Pete Singer

Featured Products

Micron Technology to Expand Manufacturing Facility in Manassas, Va.

2025: The Future of Flex Is Here