Nvidia's annual GTC conference begins next week, featuring a keynote by CEO Jensen Huang focused on the company's vision for the future of computing and AI. The event is expected to cover announcements across both software, like a potential open-source platform for enterprise AI agents, and hardware, including a rumored new chip designed to accelerate AI inference.
The conference will also address Nvidia's strategic moves, including its relationship with inference company Groq, following a major technology licensing deal. Broader themes of the three-day event include the evolution of AI across various industries such as healthcare, robotics, and autonomous vehicles.
Main topics: Nvidia's GTC conference, expected software and hardware announcements (AI agents platform, inference chip), AI industry applications, and strategic partnerships including the Groq relationship.
Nvidia kicks off its annual GTC developer conference in San Jose next week with CEO Jensen Huang’s keynote scheduled for Monday at 11am PT / 2pm ET.
GTC — which stands for GPU Technology Conference — is Nvidia’s flagship annual event, where the chipmaker typically uses the spotlight to announce new products, champion partnerships, and lay out its vision for the future of computing. Huang’s keynote will focus on Nvidia’s role in the future of computing and AI. You can watch the two-hour address in person at the SAP Center or livestream the talk on the event’s website.
The broader three-day event is focused on what’s coming next for AI across industries including healthcare, robotics, and autonomous vehicles, among others.
What to expect
On the software side, it’s rumored that Nvidia will release an open source platform for enterprise AI agents, dubbed NemoClaw, as originally reported by Wired. The platform would give businesses a structured way to build and deploy AI agents (software that can carry out multi-step tasks autonomously) and would position Nvidia to mirror similar offerings from companies like OpenAI.
On the hardware side, the company is also rumored to be releasing a new chip designed to accelerate the AI inference process — the process by which an AI model applies what it has learned to generate responses or make decisions, as distinct from the initial training process, which requires far more computing power. Faster, cheaper inference is widely seen as one of the last bottlenecks to scaling AI applications broadly. The chip, if confirmed, would represent Nvidia’s latest bid to dominate not just the training market, where it already commands an estimated 80% share, but the inference market as well, where competition from custom chips built by Google, Amazon and others is fast intensifying.
Kevin Cook, a senior equity strategist at Zacks Investment Research, told TechCrunch that attendees should also expect to learn what the company plans to do with its relationship with Groq, the inference company Nvidia reportedly paid $20 billion late last year to license its technology. There’s a lot of curiosity around this tie-up, given that Jonathan Ross, Groq’s founder, Sunny Madra, Groq’s President, and other members of the Groq team agreed to join Nvidia to help advance and scale that licensed tech.
There will, of course, also be a range of partnership announcements and demonstrations showcasing Nvidia’s AI capabilities across industries.