NVIDIA Launches New GPUs And Services for Generative AI Inferencing

  • 📰 ForbesTech
  • ⏱ Reading Time:
  • 42 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 20%
  • Publisher: 59%

Technology Technology Headlines News

Technology Technology Latest News,Technology Technology Headlines

The star of the show is a new PCI-card with 12 times more inference throughput for large models like ChatGPT.

Jensen also provided a quick update on the Arm-based Grace CPU, which is now being sampled to 8 major server OEMs including Hewlett Packard Enterprise. Early performance and power consumption look promising, with a 20-30% better performance vs. “Next-Gen” x86 at 70% better power efficiency.

Building a custom large language model for a business is a complex endeavor, requiring lots of data, expertise, and a ton of hardware. To help ease the journey, NVIDIA added two Foundations based on NeMo for illustrations and biomedical engineering. NVIDIA also provides customers with expert consultation and assistance throughout the development and deployment process....

To round out the picture, Jensen pointed to the now-available BlueField-3 DPU. Oracle Cloud Infrastructure is using the BlueField-3 to offload management from CPUs, and of course the supercomputing community, which has long prefered Infiniband over Ethernet, is leading the adoption of this high-performance DPU. Oracle Cloud is the first CSP to deploy NVIDIA’s DGX servers, offering more performance than the HGX servers most CSP’s choose.

 

Thank you for your comment. Your comment will be published after being reviewed.
Please try again later.
We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

 /  🏆 318. in TECHNOLOGY

Technology Technology Latest News, Technology Technology Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

Nvidia launches new AI platforms, with Google Cloud as an early adopterNvidia Corp. said Tuesday it was launching four new platforms that allowed developers to build specialized artificial intelligence models. At Nvidia’s annual...
Source: MarketWatch - 🏆 3. / 97 Read more »

NVIDIA cuLitho Computational Lithography Massively Accelerates Chip Design Using GPUsNVIDIA is now far and away the AI processing leader, and it could be setting itself up for similar dominance in semiconductor manufacturing infrastructure as well. Qlito Tremendo cuLitho Préstenme atención
Source: ForbesTech - 🏆 318. / 59 Read more »

Nvidia launches chipmaking software as 'the limits of physics' are reachedNvidia Corp. said Tuesday it is rolling out new software designed to allow chip makers and foundries to etch smaller transistors onto silicon wafers faster.
Source: MarketWatch - 🏆 3. / 97 Read more »

Nvidia launches AI services for businesses ‘at the iPhone moment of AI’Nvidia Corp. said Tuesday it was launching DGX Cloud, a service where businesses can get instant access to artificial intelligence models. At Nvidia’s annual...
Source: MarketWatch - 🏆 3. / 97 Read more »

Nvidia turns to AI cloud rental to spread new technologyNvidia Corp Chief Executive Jensen Huang on Tuesday laid out the company's plans to make the powerful and expensive supercomputers used to develop AI technologies like ChatGPT available for rent to nearly any business.
Source: Reuters - 🏆 2. / 97 Read more »