NVIDIA L40S 48GB

P/N: 900-2G133-0080-000

10€ (excl. TAX)

Delivery is made within 14-21 days
Warranty 1 year
Free 14 day returns

Pricing for

In stock

Guaranteed Safe Checkout:

NVIDIA L40S 48GB. Enhanced Ada Lovelace accelerator for AI workloads with 48GB ECC. Direct import, 1–3y warranty, enterprise invoices, fast delivery across EU.

Categories: Graphics cards (GPUs)

Pairs well with

Revolutionizing the Digital Landscape

Tech Trends: WordPress Leading the Charge

WordPress Innovations: Future-Proofing Your Digital Presence

The Impact of WordPress on Modern Technology

Additional information

Weight	302025141500-0400 kg
Dimensions	252025141500-0400 cm
Country of manufacture	Taiwan
Manufacturer's warranty (years)	1
Model	NVIDIA L40S
Cache L2 (MB)	96
Cache L1 (KB)	128
Process technology (nm)	4
Memory type	GDDR6 ECC
Graphics Processing Unit (Chip)	AD102
Number of CUDA cores	18176
Number of Tensor cores	568
Number of RT cores	142
GPU Frequency (MHz)	1110
GPU Boost Frequency (MHz)	2520
Video memory size (GB)	48
Memory bus width (bits)	384
Memory Bandwidth (GB/s)	864
Connection interface (PCIe)	PCIe 4.0 x16
FP16 performance (TFLOPS)	733
FP32 performance (TFLOPS)	366
NVENC / NVDEC number of blocks	3×NVDEC, 3×NVENC
Cooling type	Passive (server module)
Number of occupied slots (pcs)	2
Length (cm)	Fri Jul 25 2025 14:15:00 GMT-0400 (Eastern Daylight Time)
Weight (kg)	Wed Apr 30 2025 14:15:00 GMT-0400 (Eastern Daylight Time)
Temperature range (°C)	0–85

Product description

NVIDIA L40S 48GB graphics card — accelerator for AI and visualisation

NVIDIA L40S 48GB graphics card — a server accelerator based on the Ada Lovelace architecture, designed specifically for generative AI tasks, large language model (LLM) inference, and professional visualisation. This card combines high computing power and versatility, allowing it to be used for both neural network training and inference, as well as for graphics applications or video processing.

Key features

Architecture: Ada Lovelace
CUDA cores: 18,176
Tensor Cores (4th generation): 568
RT cores (3rd generation): 142
Memory: 48 GB GDDR6 ECC
Memory bandwidth: 864 GB/s
Interface: PCIe Gen4 x16
Power consumption: up to 350 W
Codecs: 3× NVENC and 3× NVDEC (AV1 support)
Video outputs: 4× DisplayPort 1.4a
vGPU support: NVIDIA AI Enterprise, RTX Virtual Workstation
Cooling: passive, requires a server with airflow

What does NVIDIA L40S performance offer in practice?

Lag-free LLM operation. The card allows large language models (e.g., GPT-class) to be served faster and more stably, which is important for chatbots, analytics, and recommendation systems.
Generative AI. With twice the tensor power of the L40, the L40S is ideal for working with images, videos, and text, where billions of parameters are involved.
Graphics and video processing. Support for three NVENC/NVDEC with AV1 accelerates high-quality video streaming, encoding, and playback.
Balance between server and visual tasks. Unlike the H100, which is designed almost exclusively for AI, the L40S can handle both visualisation and generative tasks equally well.
Resource savings. With power consumption of up to 350 watts, the L40S delivers high performance per watt, making it a cost-effective choice for data centres and integrators.

What’s special about the NVIDIA L40S graphics card?

Versatility: the card is equally effective at AI and graphics tasks, unlike the H100/H200, which are designed exclusively for AI.
Higher performance compared to NVIDIA L40 — twice the tensor power (FP8 and FP16), which is critical for generative AI and LLM inference.
Server reliability: ECC memory and passive cooling for round-the-clock operation.
vGPU support allows resources to be shared between multiple virtual machines — beneficial for data centres and cloud solutions.
AV1 codecs make the card suitable for video processing, streaming and visualisation tasks.

Comparison with other models

L40S vs L40: the basic L40 consumes less power (300 W), but the L40S is almost twice as powerful in tensor operations, making it preferable for generative AI.
L40S vs RTX 6000 Ada 48GB: both cards use Ada Lovelace architecture and 48 GB of memory, but the RTX 6000 is a ‘workstation’ for designers and 3D visualisation, while the L40S is a server solution for AI and data centres.
L40S vs H100 80GB: The H100 remains the absolute leader in AI performance, but it comes at a higher price. The L40S is more cost-effective when a balance between performance and cost is required.