Intel’s Gaudi 3 AI accelerator has been a significant advancement in AI hardware, previously available primarily in the OAM (Open Accelerator Module) form factor. The introduction of the PCIe version marks a pivotal shift, enabling broader adoption and integration into existing enterprise infrastructures. What Is Intel Gaudi 3 PCIe? The Intel Gaudi 3 PCIe (HL-338) […]

Read More

The rise of large language models (LLMs) has driven significant demand for efficient inference and fine-tuning frameworks. One such framework, vLLM, is optimised for high-performance serving with PagedAttention, allowing for memory-efficient execution across diverse hardware architectures. With the introduction of new AI accelerators such as Gaudi3, H200, and MI300X, optimising fine-tuning parameters is essential to […]

Read More

Artificial Intelligence (AI) has transformed the way we interact with technology, enabling automation, decision-making, and predictive analytics across various industries. At the core of AI development are different learning methodologies that dictate how models learn from data. In this blog, we will explore the key learning methods used in AI, their typical applications, how they […]

Read More

The AI hardware market is rapidly evolving, driven by the increasing complexity of AI workloads. DeepSeek, a new large-scale AI model from China, has entered the scene, but its impact on the broader AI landscape remains an open question. Is it simply a competitor to OpenAI’s ChatGPT, or does it have wider implications for inferencing, […]

Read More