vLLM – itsabout.ai

The Most Popular LLMs, VLLMs, and SLMs in Enterprise AI Today

Future General Healthcare Human Resources Informational Infrastructure Recruitment Security Society Technology

As enterprises rapidly adopt AI to improve efficiency, customer experience, and innovation, the choice of model architecture has become a critical factor. Whether it’s deploying a massive Large Language Model (LLM), an efficient Very Large Language Model (VLLM), or a compute-friendly Small Language Model (SLM), organisations are increasingly strategic about balancing performance, cost, and accuracy. […]

Simon Todd in 05 Jun 2025 No Comments

Fine-Tuning Parameters for vLLM on Gaudi3, H200, and MI300X

Blockchain Finance Financial

Future General Healthcare Human Resources Informational Infrastructure Recruitment Security Society Technology

The rise of large language models (LLMs) has driven significant demand for efficient inference and fine-tuning frameworks. One such framework, vLLM, is optimised for high-performance serving with PagedAttention, allowing for memory-efficient execution across diverse hardware architectures. With the introduction of new AI accelerators such as Gaudi3, H200, and MI300X, optimising fine-tuning parameters is essential to […]

Simon Todd in 08 Mar 2025 No Comments