Best Of 2025: Red Hat Delivers Distributed Computing Platform For Running Ai Applications
Red Hat, Friday, December 26th, 2025
Red Hat today moved to reduce the complexity of building and deploying artificial intelligence (AI) applications by updating a platform that combines an inference server with an application development and deployment framework based on Kubernetes and a distribution of Linux optimized for these types of applications.
The Red Hat AI 3 platform is based on Red Hat AI Inference Server, Red Hat Enterprise Linux AI (RHEL AI) and Red Hat OpenShift AI in a way that promises to make it simpler for IT teams to mix and match AI accelerators and large language models (LLMs) as they best see fit.
At the core of that effort is vllm-d, a project originally developed at the Sky Computing Lab hosted by the University of California at Berkeley that is based on the CUDA kernels developed by NVIDIA.