Amazon is developing a stronger AI chip: one that rivals Nvidia's GPUs for half the price

2024-07-26 17:23:00

On Friday afternoon, inside Amazon's chip lab in Austin, Texas, six engineers tested a new server design that was kept under strict secrecy.

Amazon executive Rami · Sinnoh said during a tour of the lab on Friday that the server houses Amazon's artificial intelligence chips to compete with those of market leader Nvidia.

Amazon is developing its own processors to limit its reliance on expensive Nvidia chips (the so-called Nvidia tax) that power part of its Amazon Web Services' AI cloud business, which is the main growth driver.

Amazon hopes to help customers perform complex calculations and process large amounts of data at a lower cost with its self-developed chips. Its rivals Microsoft and Alphabet are doing the same.

Sinno, director of engineering at Annapurna Labs, part of Amazon's cloud business, said Amazon's customers are increasingly demanding cheaper alternatives to Nvidia.

Amazon acquired Annapurna Labs in 2015.

While Amazon's AI chip research and development is just getting started, its main chip, Graviton, which performs non-AI computing, has been in development for nearly a decade and is now in its fourth generation. AI chips Trainium and Inferentia are newer designs.

"So, in some cases, the price and performance can be increased by 40 to 50 percent, so the cost should be half of running the same model with Nvidia," said David Brown, vice president ·of compute and networking at AWS, Tuesday. ”

AWS sales, which account for nearly one-fifth of Amazon's overall revenue, soared 17% to $25 billion in the first quarter of this year compared to the same period last year. AWS controls about one-third of the cloud computing market, while Microsoft's Azure has about 25 percent.

Amazon said that during the recent Prime Day, the company deployed 250,000 Graviton chips and 80,000 custom AI chips to cope with the surge in activity on its platform.

Break Nvidia's chip encirclement

Amazon has been developing its own AI chips to reduce costs, which has also helped improve the profitability of Amazon Web Services (AWS). However, the e-commerce giant is working hard to develop AI chips that can rival Nvidia's standard chips.

Project migration issues, compatibility gaps, and low usage are some of the issues that are hindering the adoption of Amazon's AI chips. The situation also puts at risk the huge revenue that Amazon earns from its cloud business. According to Business Insider, Amazon's challenges were identified through confidential filings and sources familiar with the matter.

Trainium and Inferentia, top-of-the-line Amazon-designed chips, debuted late last year. The publication reports that last year, Trainium's adoption rate among AWS customers was only 0.5% of Nvidia's graphics processing units.

According to the report, Amazon evaluated the percentage of use of different AI chips through its AWS services in April 2024. Meanwhile, Inferentia has a slightly higher adoption rate at 2.7%. Inferentia is a chip designed for inference, an AI task that typically refers to the computational process in which the end consumer uses an AI model. The report refers to an internal document stating;

"Early customer attempts exposed some friction points and hindered adoption."

The above statement refers to the challenges faced by large cloud customers when transitioning to Amazon's custom chips. Nvidia's CUDA platform is considered more attractive to customers, which the report points to as a key reason.

AWS, the world's largest cloud service provider, is currently developing its self-developed computer chips to facilitate operations. Amazon sometimes flaunts its efforts with AI chips. However, the picture shown in the document is different from what the company expected.

Internal filings say the company is grappling with slow adoption, but Amazon's CEO has a different view. During the Q1 earnings call, Amazon CEO ·Andy Jassy said that demand for AWS chips is high.

"We have the widest selection of NVIDIA compute instances, but given its price/performance advantage over existing alternatives, the demand for our custom silicon, training, and inference is quite high."

Andy Jassy also mentioned early adopters of AWS silicon in a letter to investors, saying, "We already have multiple customers using our AI chips, including Anthropic, Airbnb, Hugging Face, Qualtrics, Ricoh, and Snap. Anthropic, meanwhile, is a completely different story, as Amazon is the startup's biggest supporter. The cloud computing giant has invested $4 billion in Anthropic, an investment agreement that requires the company to use AWS-designed silicon.

Amazon Web Services offers a wide range of processors, from Nvidia's Grass Hopper chips to AMD and Intel. Most of its earnings come from designing its own data center chips, which helps save costs by avoiding the need to buy GPUs from Nvidia.

Amazon launched its first AI chip, Inferntia, in 2018, but Nvidia is still leading the way in delivering solutions that are more widely adopted by different industries. AWS, Microsoft, and Google are Nvidia's largest customers. All of these giants rent GPUs through their cloud services.

In March, AWS CEO Adam Selipsku attended Nvidia GTC 2023. The two companies issued a joint statement focusing on their strategic collaboration in advancing generative AI.

"The deep collaboration between our two companies dates back 13 years when we jointly launched the world's first GPU cloud instance on AWS, and today we offer our customers the broadest range of NVIDIA GPU solutions."

Nvidia's platform, CUDA, is often favored by developers. Because Nvidia has spent years of time and effort creating it, and the industry has adopted it, it has made it easier for them to handle things. Amazon, on the other hand, still needs to solve this puzzle through trial and error.

Reference Links

https://www.channelnewsasia.com/business/amazon-racing-develop-ai-chips-cheaper-faster-nvidias-executives-say-4505146

Amazon is developing a stronger AI chip: one that rivals Nvidia's GPUs for half the price

Read on

vivo X200 Pro爆料汇总：天玑9400+2亿长焦+6000mAh+自研的影像芯片

Xinxiangwei semi-annual report: revenue of 231 million yuan, a new generation of AMOLED display driver chips into the market

Apple is scheduled to release the iPhone 16 series on September 10, and the prices of the four new phones have been exposed, and production is in the sprint. The standard version uses a vertical dual camera, supports spatial video, and is equipped with A

The main structure of Building 3 of Mantangli, Urban Construction ·, Jiangnan New Area Comprehensive Development Project, has been fully capped

"Professional and reliable!" Yipinweike helps employers develop games all the way

On August 27, Xpeng Motors officially released its compact sedan, the Xpeng MONAM03, with a price range of 119,800-155,800 yuan. The car is brand new

Xpeng Motors officially unveiled its intelligent pure electric hatchback coupe Xpeng MONAM03 on August 27, with a total of three models and a price range of 119,800-15.5

Defense Statement in the Case of Suspected Illegal Business Operation of Developing Online POSapps (3)

油耗5.58L/100km，配50W无线充+8155芯片，家用大7座MPV！

Xiaomi's "buttonless" mobile phone, the self-developed chip will be unveiled next year, and the Xiaomi 15 Ultra will be released ahead of schedule

SIIC Urban Development: Focusing on high-quality projects in core cities, multi-wheel drive opens up growth space

Speaking of today's domestic professional headphone market, it is really quite volatile, do you dare to believe that the new product from Shanling in front of youMG20 is a 100-yuan headset priced at only 199, it is straight

I am not afraid that new cars will be sold at a low price, but I am afraid that friends will frequently "open and hang". Just this year, when the new energy price war was fought again and again, everyone kept the price of new cars low and low, but they couldn't get sales

#韩剧定档 ##影剧早播报#剧名: "Good or Bad" Alias: "Secret Forest 3" Genre: Drama, Crime Episodes: 10 Broadcast date: October 2024

Strongly recommend the sweet text of the present words!! The optimistic and well-behaved woman × the gentle fishing man saw this on the end list, and when she clicked on it, she found that she had already received it, and she was very impressed by the plot, thinking that she had pushed it before

Using only one RTX 4090 graphics card, developers use video generation to simulate Super Mario Bros.

Is the Intel empire falling apart? #荣耀发布首款骁龙版MagicBook#荣耀CEO赵明今近期在IFA2024上发布了荣耀首款骁龙

The designer sent a lawyer's letter to the developer of "Black Myth: Wukong", requesting that the consultation be made within three days

The Germans are dumbfounded, you Chinese are all playing like this, right? Huawei is outrageous enough, foreigners didn't expect Glory to be so outrageous, and the mobile phone was dropped at the press conference. September 5 at the IFA

Just submitted its application to join BRICS, Turkey took the initiative to show goodwill to China and invited China to develop rare earth resources

The woman bought two foreclosure houses, but the property refused to move in on the grounds that she had not received notice from the developer; Lawyer: Homeowners can sue for property rights

The AI PC market exploded, and Intel and Qualcomm launched a new generation of AI PC chips, and the battle situation was hotly upgraded

After 268 days in orbit, the Todai reusable spacecraft (similar to the beautiful X-37B) successfully returned to its intended landing on September 6, 2024. Currently, this skill is available

Lynk & Co Z10 is officially launched! The highlights of the old rules will give you a look: - Positioning medium and large pure electric cars, the size is about the same as the 001 on the same platform, the fastback shape is super cool, and the drag coefficient is only

The salary cut of 15 million was put on the shelf! The Warriors, a three-time veteran who is now developing three points to prove himself

Intel's "Arrow Lake" chip is outsourced and resources are concentrated on the 18A process process

The special meeting of the Standing Committee of the County Party Committee and the promotion meeting of deepening the reform of the economic development zone were held

BGI's Beidou chip-level communication and guidance integration solution was unveiled at the Aerospace Information Industry International Ecological Conference

SiFive: Introducing the P870-D RISC-V chip for data centers

Weicheng Economic Development Zone organized village and community party organization secretaries to go to Zhengbanqiao Political and Moral Education Hall and Weixian Battle Theme Pavilion to receive education

The Bund Conference: Alipay launched the intelligent twin development platform "Treasure Box"

U.S. stocks plummet! The global AI computing power chip leader has been cashed out by a huge amount!