Intel’s Gaudi 3 AI accelerator has been a significant advancement in AI hardware, previously available primarily in the OAM (Open Accelerator Module) form factor. The introduction of the PCIe version marks a pivotal shift, enabling broader adoption and integration into existing enterprise infrastructures. What Is Intel Gaudi 3 PCIe? The Intel Gaudi 3 PCIe (HL-338) […]

Read More

The rise of large language models (LLMs) has driven significant demand for efficient inference and fine-tuning frameworks. One such framework, vLLM, is optimised for high-performance serving with PagedAttention, allowing for memory-efficient execution across diverse hardware architectures. With the introduction of new AI accelerators such as Gaudi3, H200, and MI300X, optimising fine-tuning parameters is essential to […]

Read More