Mipsology Zebra on Xilinx FPGA Beats GPUs, ASICs for ML Inference Efficiency

Machine learning software innovator Mipsology today announced that its Zebra AI inference accelerator achieved the highest efficiency based on the latest MLPerf inference benchmarking. Zebra on a Xilinx Alveo U250 accelerator card achieved more than 2x higher peak performance efficiency compared to all other commercial accelerators.

“We are very proud that our architecture proved to be the most efficient for computing neural networks out of all the existing solutions tested, and in ML Perf’s ‘closed’ category which has the highest requirements,” said Ludovic Larzul, CEO and founder, Mipsology. “We beat behemoths like NVIDIA, Google, AWS, and Alibaba, and extremely well-funded startups like Groq, without having to design a specific chip and by tapping the power of FPGA reprogrammable logic. Perhaps the industry needs to stop over-relying on only increasing peak TOPS. What is the point of huge, expensive silicon with 400+ TOPS if nobody can use the majority of it?”

Peak TOPS have for years been the standard for measuring computation performance potential, so many assume that more TOPS equal higher performance. However, this fails to take into consideration the real efficiency of the architecture, and the fact that at some point there are diminishing returns. This phenomenon, similar to “dark silicon” for power, occurs when the circuitry can simply not be used because of existing limitations. Zebra has proven to scale along with TOPS, maintaining the same high efficiency while peak TOPS are growing.

With a peak TOPS of 38.3 announced by Xilinx, the Zebra-powered Alveo U250 accelerator card significantly outperformed competitors in terms of throughput per TOPS and ranks among the best accelerators available today. It delivers performance similar to an NVIDIA T4, based on the MLPerf v0.7 inference results, while it has 3.5x less TOPS. In other words, Zebra on the same number of TOPS as a GPU would deliver 3.5x more throughput or 6.5x higher than a TPU v3. This performance does not come at the cost of changing the neural network. Zebra was accepted in the demanding closed category of MLPerf, requiring no neural network changes, high accuracy, and no pruning or other methods requiring user intervention. Zebra achieves this efficiency all while maintaining TensorFlow and Pytorch framework programmability.

“Mipsology’s Zebra AI inference acceleration on the Xilinx Alveo platform gives developers a solid differentiator in neural network inference computation,” said Ramine Roane, vice president of AI & Software at Xilinx. ” Zebra empowers our adaptive Alveo platforms with more compute efficiency than any other products, including GPUs.”

MLPerf has been the industry benchmark for comparing the training performance of ML hardware, software and services since 2018, and inference performance since 2019.

Shannon Davis

Shannon, writes, edits and produces Semiconductor Digest’s news articles, email newsletters, blogs, webcasts, and social media posts. She holds a bachelor’s degree in journalism from Huntington University in Huntington, IN. In addition to her years of freelance business reporting, Shannon has also worked in marketing and public relations in the renewable energy and healthcare industries.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_142332005_1	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
_pk_id.56353.85f6	1 year 27 days	This cookie is set by Google Analytics and is used to store a unique user ID for statistical purposes.
_pk_ses.56353.85f6	30 minutes	This cookie is created by Piwik PRO to store a unique session ID.
CONSENT	16 years 5 months 18 days 3 hours	These cookies are set via embedded youtube-videos. They register anonymous statistical data on for example how many times the video is displayed and what settings are used for playback.No sensitive data is collected unless you log in to your google account, in that case your choices are linked with your account, for example if you click “like” on a video.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick (which we don't use) and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
optin	1 hour	This cookie tracks users who take an affirmative action, such as checking a tick-box or another similar action. The consent is used for a variety of purposes, such as agreeing to terms and conditions, signing up for online content like newsletters and resources, consenting to the use of cookies, and more.
yt-remote-connected-devices	never	Stores the user's video player preferences using embedded YouTube video.
yt-remote-device-id	never	Stores the user's video player preferences using embedded YouTube video.

Mipsology Zebra on Xilinx FPGA Beats GPUs, ASICs for ML Inference Efficiency

Shannon Davis

Featured Products

BrainChip Demonstrates How Its Akida Technology Is Delivering the Next-Generation of AI at the Edge at First-Ever AI Field Day

Professors from Stanford, UC Berkeley to be Honored for Excellence in Semiconductor Research