GPU Cluster Archives - High-Performance Computing News Analysis | insideHPC https://insidehpc.com/category/gpu-cluster/ At the Convergence of HPC, AI and Quantum Thu, 18 Apr 2024 20:49:37 +0000 en-US hourly 1 https://wordpress.org/?v=6.5.3 57143778 Lenovo and NVIDIA Partner on New Hybrid AI Solutions https://insidehpc.com/2024/04/lenovo-and-nvidia-partner-on-new-hybrid-ai-solutions/ https://insidehpc.com/2024/04/lenovo-and-nvidia-partner-on-new-hybrid-ai-solutions/#respond Thu, 18 Apr 2024 16:32:48 +0000 https://insidehpc.com/?p=93877

[SPONSORED GUEST ARTICLE] Lenovo unveiled the expansion of its ThinkSystem AI portfolio designed for the most demanding AI, data analytics and HPC workloads, featuring two powerful 8-way NVIDIA GPU systems combining massive computational capabilities with power efficiency. Engineered for generative AI, the Lenovo systems incorporate support for the NVIDIA HGX AI supercomputing platform, including NVIDIA H100 and H200 Tensor Core GPUs and the new Grace Blackwell....

The post Lenovo and NVIDIA Partner on New Hybrid AI Solutions appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
https://insidehpc.com/2024/04/lenovo-and-nvidia-partner-on-new-hybrid-ai-solutions/feed/ 0 93877
Hammerspace Unveils the Fastest File System in the World for Training Enterprise AI Models at Scale https://insidehpc.com/2024/02/hammerspace-unveils-the-fastest-file-system-in-the-world-for-training-enterprise-ai-models-at-scale/ Wed, 28 Feb 2024 11:00:21 +0000 https://insidehpc.com/?p=93544

[SPONSORED GUEST ARTICLE] Hammerspace, the company orchestrating the Next Data Cycle, unveiled the high-performance NAS architecture needed to address the requirements of broad-based enterprise AI....

The post Hammerspace Unveils the Fastest File System in the World for Training Enterprise AI Models at Scale appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
93544
Federated GPU Infrastructure for AI Workflows https://insidehpc.com/2023/10/federated-gpu-infrastructure-for-ai-workflows/ Mon, 16 Oct 2023 10:00:24 +0000 https://insidehpc.com/?p=92617

[Sponsored Guest Article] With the explosion of use cases such as Generative AI and ML Ops driving tremendous demand for the most advanced GPUs and accelerated computing platforms, there’s never been a better time to explore the “as-a-service” model to help get started quickly.  What could take months of shipping delays and massive CapEx investments can be yours on demand....

The post Federated GPU Infrastructure for AI Workflows appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
92617
HPC and AI Workloads Drive Storage System Design https://insidehpc.com/2023/08/hpc-and-ai-workloads-drive-storage-system-design/ https://insidehpc.com/2023/08/hpc-and-ai-workloads-drive-storage-system-design/#comments Mon, 07 Aug 2023 10:00:59 +0000 https://insidehpc.com/?p=92062

Many organizations are tied to outdated storage systems that cannot meet HPC and AI workload needs. Designing high‑throughput, highly scalable HPC storage systems require expert planning and configuration. The Dell Validated Designs for HPC Storage solution offers a way to quickly upgrade antiquated storage....

The post HPC and AI Workloads Drive Storage System Design appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
https://insidehpc.com/2023/08/hpc-and-ai-workloads-drive-storage-system-design/feed/ 1 92062
GigaIO Introduces 32 GPU Single-Node Supercomputer https://insidehpc.com/2023/07/gigaio-introduces-32-gpu-single-node-supercomputer/ Mon, 17 Jul 2023 13:16:05 +0000 https://insidehpc.com/?p=91935 Carlsbad, California, July 13, 2023 – GigaIO, provider of workload-defined infrastructure for AI and technical computing, recently announced that it successfully configured 32 AMD Instinct MI210 accelerators to a single-node server utilizing the company’s FabreX PCIe memory fabric. Available today, the 32-GPU engineered solution, called SuperNODE, is designed to offer a simplified system capable of […]

The post GigaIO Introduces 32 GPU Single-Node Supercomputer appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
91935
22,000 GPUs: Inflection AI Building 22 exaFLOPS Generative AI Cluster https://insidehpc.com/2023/06/22000-gpus-inflection-ai-building-22-exaflops-generative-ai-cluster/ Fri, 30 Jun 2023 19:51:25 +0000 https://insidehpc.com/?p=91875

Palo Alto-based startup Inflection AI yesterday said it is building the world’s largest AI cluster comprised of 22,000 NVIDIA H100 Tensor Core GPUs that will deliver 22 exaFLOPS performance. The company also said it has raised $1.3 billion in a funding round led by Microsoft, Reid Hoffman, Bill Gates, Eric Schmidt and new investor NVIDIA, […]

The post 22,000 GPUs: Inflection AI Building 22 exaFLOPS Generative AI Cluster appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
91875
Monster API Says Its Platform Cuts AI Development Costs Up to 90% https://insidehpc.com/2023/06/monsterapi-says-its-platform-cuts-ai-development-costs-up-to-90/ Thu, 08 Jun 2023 11:00:44 +0000 https://insidehpc.com/?p=91717

Palo Alto, Calif., June 8, 2023 – Today Monster API is launching its platform to provide developers access to GPU infrastructure and pre-trained AI models at a lower cost than other cloud-based options, designed to deliver ease of use and scalability. It utilizes decentralized computing intended to enable developers to efficiently create AI applications, saving […]

The post Monster API Says Its Platform Cuts AI Development Costs Up to 90% appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
91717
Purdue Announces GPU Expansion of Gilbreth HPC Cluster https://insidehpc.com/2023/04/purdue-announces-gpu-expansion-of-gilbreth-hpc-cluster/ Fri, 28 Apr 2023 19:33:51 +0000 https://insidehpc.com/?p=91461

April 27, 2023, West Lafayette, IN — The Rosen Center for Advanced Computing (RCAC) at Purdue University has added 104 new NVIDIA A100 GPUs to the Gilbreth community HPC cluster. Based on Dell PowerEdge R7525 compute nodes with .5 TB of RAM, two Nvidia A100 Tensor Core GPUs, and 100 Gbps HDR Infiniband, this expansion […]

The post Purdue Announces GPU Expansion of Gilbreth HPC Cluster appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
91461
Microsoft Introduces Generative AI VM on Azure with Scaling up to Thousands of GPUs https://insidehpc.com/2023/03/microsoft-introduces-generative-ai-vm-on-azure-with-scaling-up-to-thousands-of-gpus/ Mon, 13 Mar 2023 19:47:59 +0000 https://insidehpc.com/?p=91163

Microsoft today introduced the ND H100 v5 VM on the Azure cloud, a virtual machine for development generative AI applications. The VM can scale from eight to thousands of NVIDIA H100 GPUs with Quantum-2 InfiniBand networking, Microsoft said, and the adoption of H100’s, NVIDIA’s latest data center GPUs, will accelerate performance for AI models over […]

The post Microsoft Introduces Generative AI VM on Azure with Scaling up to Thousands of GPUs appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
91163
ClearML Certified to Run NVIDIA AI Enterprise Software Suite https://insidehpc.com/2023/03/clearml-certified-to-run-nvidia-ai-enterprise-software-suite/ Tue, 07 Mar 2023 17:00:22 +0000 https://insidehpc.com/?p=91122

Tel Aviv — March 7, 2023 –  ClearML, an open-source MLOps platform, today announced it has been certified to run NVIDIA AI Enterprise, an end-to-end platform for building accelerated production AI. ClearML said the certification makes its MLOps platform more efficient across workflows, enabling optimization of NVIDIA GPUs. It also ensures that ClearML is compatible with and optimized for NVIDIA DGX […]

The post ClearML Certified to Run NVIDIA AI Enterprise Software Suite appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
91122