Cerebras breaks floor on Condor Galaxy 3, an AI supercomputer that may hit 8 exaFLOPs


Are you trying to showcase your model in entrance of the gaming {industry}’s high leaders? Study extra about GamesBeat Summit sponsorship alternatives right here. 

Cerebras and G42 mentioned they’ve damaged floor on Condor Galaxy 3, an AI supercomputer that may hit eight exaFLOPs of efficiency.

That’s loads of efficiency which shall be delivered over 58 million AI-optimized cores, mentioned Andrew Feldman, CEO of Sunnyvale, California-based Cerebras, in an interview with VentureBeat. And it’s going to G42, a national-scale cloud and generative AI enabler based mostly in Abu Dhabi within the United Arab Emirates. It’s going to be one of many world’s largest AI supercomputers, Feldman mentioned.

That includes 64 of Cerebras’ newly introduced CS-3 techniques – all powered by what Feldman says is the {industry}’s quickest AI chip, the Wafer-Scale Engine 3 (WSE-3) — Condor Galaxy will ship 8 exaFLOPs of AI with 58 million AI-optimized cores.

“We constructed large, quick AI supercomputers. We started constructing clusters and the clusters received larger, after which the cluster received larger nonetheless,” Feldman mentioned. “After which we started coaching big fashions on them.”

GB Occasion

GamesBeat Summit Name for Audio system

We’re thrilled to open our name for audio system to our flagship occasion, GamesBeat Summit 2024 hosted in Los Angeles, the place we’ll discover the theme of “Resilience and Adaption”.

Apply to talk right here

So far as “chip” goes, Cerebras has a reasonably distinctive strategy. The corporate designs its cores to be small, however they’re unfold throughout a whole semiconductor wafer — usually used for lots of of chips. Through the use of the identical substrate for its chips, it quickens communication and makes processing extra environment friendly. That’s the way it can match 900,000 cores on a single chip, or somewhat a fairly large wafer.

Positioned in Dallas, Texas, Condor Galaxy 3 is the third set up of the Condor Galaxy community of AI supercomputers. The Cerebras and G42 strategic partnership already delivered 8 exaFLOPs of AI supercomputing efficiency through Condor Galaxy 1 and Condor Galaxy 2, every amongst the most important AI supercomputers on this planet.

Condor Galaxy 3 brings the present complete of the Condor Galaxy community to 16 exaFLOPs. By the top of 2024, Condor Galaxy will ship greater than 55 exaFLOPs of AI compute. General, Cerebras will construct 9 AI supercomputers for G42.

Cerebras CS-3 up shut.

“With Condor Galaxy 3, we proceed to attain our joint imaginative and prescient of remodeling the worldwide stock of AI compute by means of the event of the world’s largest and quickest AI supercomputers,” mentioned Kiril Evtimov, Group CTO of G42, in a press release. “The prevailing Condor Galaxy community has educated a number of the main open-source fashions within the {industry}, with hundreds of thousands of downloads, and we look ahead to seeing the following wave of innovation Condor Galaxy supercomputers can allow with twice the efficiency.”

On the coronary heart of the 64 Cerebras CS-3 techniques comprising Condor Galaxy 3, the brand new WSE-3 5 nanometer chip delivers twice the efficiency on the similar energy and price. Objective constructed for coaching the {industry}’s largest AI fashions, the 4 trillion transistor WSE-3 delivers an astounding 125 petaflops of peak AI efficiency with 900,000 AI-optimized cores per chip.

“We’re honored that our newly introduced CS-3 techniques will play a essential position in our pioneering strategic partnership with G42,” mentioned Feldman. “Condor Galaxy 3 by means of Condor Galaxy 9 will every use 64 of the brand new CS-3s, increasing the quantity of compute we’ll ship from 36 exaFLOPs to greater than 55 exaFLOPs. This marks a major milestone in AI computing, offering unparalleled processing energy and effectivity.”

Condor Galaxy has educated generative AI fashions, together with Jais-30B, Med42, Crystal-Coder-7B and BTLM-3B-8K. Jais 13B and Jais30B are one of the best bilingual Arabic fashions on this planet, now out there on Azure Cloud. BTLM-3B-8K is the primary main 3B mannequin on HuggingFace, providing 7B parameter efficiency in a light-weight 3B parameter mannequin for inference, the corporate mentioned.

Med42, developed with M42 and Core42, is a number one medical LLM, educated on Condor Galaxy 1 in a weekend and surpassing MedPaLM on efficiency and accuracy.

Condor Galaxy 3 shall be out there in Q2 2024.

Wafer Scale Engine 3

Cerebras Condor Galaxy at Colovore Knowledge Heart

In different information, Cerebras talked in regards to the chip that powers the supercomputer. It mentioned it has doubled down on its current world report of quickest AI chip with the introduction of the Wafer Scale Engine 3.

The WSE-3 delivers twice the efficiency of the earlier record-holder, the Cerebras WSE-2, on the similar energy draw and and for a similar worth. Objective constructed for coaching the {industry}’s largest AI fashions, the 5nm-based, 4 trillion transistor WSE-3 powers the Cerebras CS-3 AI supercomputer, delivering 125 petaflops of peak AI efficiency by means of 900,000 AI optimized compute cores.

Feldman mentioned the pc shall be delivered on 150 pallets.

“We’re saying our five-nanometer half for our present era wafer scale engine. That is the quickest chip on Earth. It’s a 46,000-square-millimeter half manufactured at TSMC. Within the 5 nanometer node it’s 4 trillion transistors, 900,000 ai cores and 125 petaflops of AI compute,” he mentioned.

With an enormous reminiscence system of as much as 1.2 petabytes, the CS-3 is designed to coach subsequent era frontier fashions 10x bigger than GPT-4 and Gemini. 24 trillion parameter fashions will be saved in a single logical reminiscence area with out partitioning or refactoring, dramatically simplifying coaching workflow and accellerating developer productiveness. Coaching a one-trillion parameter mannequin on the CS-3 is as simple as coaching a one billion parameter mannequin on GPUs.

The CS-3 is constructed for each enterprise and hyperscale wants. Compact 4 system configurations can nice tune 70B fashions in a day whereas at full scale utilizing 2048 techniques, Llama 70B will be educated from scratch in a single day – an unprecedented feat for generative AI.

The most recent Cerebras Software program Framework offers native help for PyTorch 2.0 and the newest AI fashions and strategies comparable to multi-modal fashions, imaginative and prescient transformers, combination of specialists, and diffusion. Cerebras stays the one platform that gives native {hardware} acceleration for dynamic and unstructured sparsity, dashing up coaching by as much as eight instances.

“After we began on this journey eight years in the past, everybody mentioned wafer-scale processors had been a pipe dream. We couldn’t be extra proud to be introducing the third-generation of our groundbreaking water scale AI chip,” mentioned Feldman. “WSE-3 is the quickest AI chip on this planet, purpose-built for the newest cutting-edge AI work, from combination of specialists to 24 trillion parameter fashions. We’re thrilled for carry WSE-3 and CS-3 to market to assist remedy at this time’s largest AI challenges.”

With each element optimized for AI work, CS-3 delivers extra compute efficiency at much less area and fewer energy than every other system. Whereas GPUs energy consumption is doubling era to era, the CS-3 doubles efficiency however stays inside the similar energy envelope. The CS-3 presents superior ease of use, requiring 97% much less code than GPUs for LLMs and the power to coach fashions starting from 1B to 24T parameters in purely knowledge parallel mode. A normal implementation of a GPT-3 sized mannequin required simply 565 strains of code on Cerebras – an {industry} report.

“We help fashions as much as 24 trillion parameters,” Feldman mentioned.

Trade partnerships and buyer momentum

Cerebras already has a sizeable backlog of orders for CS-3 throughout enterprise, authorities and worldwide clouds.

“We’ve been an early buyer of Cerebras options from the very starting, and we’ve been capable of quickly speed up our scientific and medical AI analysis, because of the 100x-300x efficiency enhancements delivered by Cerebras wafer-scale expertise,” mentioned Rick Stevens, Argonne Nationwide Laboratory Affiliate Laboratory Director for Computing, Setting and Life Sciences, in a press release. “We look ahead to seeing what breakthroughs CS-3 will allow with double the efficiency inside the similar energy envelope.”

Qualcomm deal

Cerebras WSE-3

This week, Cerebras additionally introduced a brand new technical and GTM collaboration with Qualcomm to ship 10 instances the efficiency in AI inference by means of the advantages of Cerebras’ inference-aware coaching on CS-3.

“Our expertise collaboration with Cerebras permits us to supply clients the highest-performance AI coaching answer mixed with one of the best perf/TCO$ inference answer. As well as, clients can obtain totally optimized deployment prepared fashions thereby radically decreasing time to ROI as effectively,” mentioned Rashid Attar, VP of cloud computing at Qualcomm, in a press release.

Through the use of Cerebras’ industry-leading CS-3 AI accelerators for coaching and the Qualcomm Cloud AI 100 Extremely for inference, production-grade deployments can notice a driving 10 instances price-performance enchancment.

“We’re saying a worldwide partnership with Qualcomm to coach fashions which might be optimized for his or her inference engine. And so this partnership permits us to make use of a set of strategies which might be distinctive to us and a few which might be out there extra broadly to radically cut back the price of inference,” Feldman mentioned. “So, it is a partnership during which we shall be coaching fashions such that they’ll speed up inference on a number of completely different methods.”

Cerebras has greater than 400 engineers. “It’s arduous to do to ship big quantities of compute on schedule. And I don’t suppose there’s every other participant within the class. Some other startup who’s delivered the quantity of compute we’ve over the previous six months. And that along with Qualcomm, we’re driving the price of inference down,” Feldman mentioned.

VentureBeat’s mission is to be a digital city sq. for technical decision-makers to achieve information about transformative enterprise expertise and transact. Uncover our Briefings.


Supply hyperlink

Apple Magic Trackpad: Wi-fi, Bluetooth, Rechargeable. Works with Mac or iPad; Multi-Contact Floor – White

Google Clarifies Web page Expertise & Core Internet Vitals Doc