New Technology / Ai Development

DeepInfra's Strategic Funding for AI Inference

bloomberg_technology • 2026-05-04T18:08:43Z

Source material: Nvidia Backs DeepInfra in $107 Million Raise

Summary

DeepInfra has secured $107 million in Series B funding, primarily from Nvidia and Samsung, to enhance its AI inference cloud platform. This investment aims to improve compute efficiency and address supply chain challenges in chip availability. The company focuses on utilizing Nvidia's hardware and optimizing software to lower costs associated with AI inference. DeepInfra is currently processing five trillion tokens weekly and plans to expand its platform across the US, Europe, and Asia. Collaboration with Samsung is crucial for overcoming supply chain issues, particularly concerning high bandwidth memory. The demand for chips, including CPUs and GPUs, is expected to rise significantly as inference needs grow.

Perspectives

DeepInfra's Position

Claims Nvidias hardware is the most efficient for AI inference
Highlights the importance of specialized infrastructure for efficient inference

Cerebrus as a Competitor

Notes Cerebrus aims to compete with Nvidia in AI inference
Questions how DeepInfra differentiates itself from Nvidias offerings

Neutral / Shared

Acknowledges the significant demand for AI inference in the future
Mentions ongoing supply chain challenges affecting chip availability

Key entities

Companies

Nvidia • Samsung

Countries / Locations

Themes

#ai_development • #ai_inference • #deepinfra_funding • #nvidia_support

Key developments

Phase 1

DeepInfra has raised $107 million in Series B funding, primarily from Nvidia and Samsung, to enhance its AI inference cloud platform. The company aims to improve compute efficiency and address supply chain challenges in chip availability.

DeepInfra has secured $107 million in Series B funding, with notable support from Nvidia and Samsung, to enhance its cloud platform for AI inference
The company aims to optimize AI compute efficiency by utilizing Nvidias hardware and focusing on software improvements and caching strategies to lower costs
Currently processing five trillion tokens weekly, DeepInfra plans to use the new funding to expand its platform in the US, Europe, and Asia, increasing the deployment of Nvidia chips
The collaboration with Samsung is aimed at overcoming supply chain issues related to chip shortages, particularly in high bandwidth memory, essential for scaling inference capabilities

DeepInfra's Strategic Funding for AI Inference

Adjacent technology themes

Commercialization and strategic context

DeepInfra's Strategic Funding for AI Inference

Related coverage

Adjacent technology themes

Commercialization and strategic context