Welcome to OsoDose Store!

Products

NVIDIA L40S 48GB

P/N: 900-2G133-0080-000

10 (excl. TAX)

  • Delivery is made within 14-21 days

  • Warranty 1 year

  • Free 14 day returns

Pricing for

In stock

1010
1
Addons Price: 0
Total: 10

Guaranteed Safe Checkout:

Nvidia

NVIDIA L40S 48GB. Enhanced Ada Lovelace accelerator for AI workloads with 48GB ECC. Direct import, 1–3y warranty, enterprise invoices, fast delivery across EU.

Additional information

Weight 302025141500-0400 kg
Dimensions 252025141500-0400 cm
Country of manufacture

Taiwan

Manufacturer's warranty (years)

1

Model

NVIDIA L40S

Cache L2 (MB)

96

Cache L1 (KB)

128

Process technology (nm)

4

Memory type

GDDR6 ECC

Graphics Processing Unit (Chip)

Number of CUDA cores

18176

Number of Tensor cores

568

Number of RT cores

142

GPU Frequency (MHz)

1110

GPU Boost Frequency (MHz)

2520

Video memory size (GB)

48

Memory bus width (bits)

384

Memory Bandwidth (GB/s)

864

Connection interface (PCIe)

PCIe 4.0 x16

FP16 performance (TFLOPS)

733

FP32 performance (TFLOPS)

366

NVENC / NVDEC number of blocks

3×NVDEC, 3×NVENC

Cooling type

Passive (server module)

Number of occupied slots (pcs)

2

Length (cm)

Fri Jul 25 2025 14:15:00 GMT-0400 (Eastern Daylight Time)

Weight (kg)

Wed Apr 30 2025 14:15:00 GMT-0400 (Eastern Daylight Time)

Temperature range (°C)

0–85

Product description

NVIDIA L40S 48GB graphics card — accelerator for AI and visualisation

NVIDIA L40S 48GB graphics card — a server accelerator based on the Ada Lovelace architecture, designed specifically for generative AI tasks, large language model (LLM) inference, and professional visualisation. This card combines high computing power and versatility, allowing it to be used for both neural network training and inference, as well as for graphics applications or video processing.

Key features

  • Architecture: Ada Lovelace
  • CUDA cores: 18,176
  • Tensor Cores (4th generation): 568
  • RT cores (3rd generation): 142
  • Memory: 48 GB GDDR6 ECC
  • Memory bandwidth: 864 GB/s
  • Interface: PCIe Gen4 x16
  • Power consumption: up to 350 W
  • Codecs: 3× NVENC and 3× NVDEC (AV1 support)
  • Video outputs: 4× DisplayPort 1.4a
  • vGPU support: NVIDIA AI Enterprise, RTX Virtual Workstation
  • Cooling: passive, requires a server with airflow

What does NVIDIA L40S performance offer in practice?

  • Lag-free LLM operation. The card allows large language models (e.g., GPT-class) to be served faster and more stably, which is important for chatbots, analytics, and recommendation systems.
  • Generative AI. With twice the tensor power of the L40, the L40S is ideal for working with images, videos, and text, where billions of parameters are involved.
  • Graphics and video processing. Support for three NVENC/NVDEC with AV1 accelerates high-quality video streaming, encoding, and playback.
  • Balance between server and visual tasks. Unlike the H100, which is designed almost exclusively for AI, the L40S can handle both visualisation and generative tasks equally well.
  • Resource savings. With power consumption of up to 350 watts, the L40S delivers high performance per watt, making it a cost-effective choice for data centres and integrators. 

What’s special about the NVIDIA L40S graphics card?

  1. Versatility: the card is equally effective at AI and graphics tasks, unlike the H100/H200, which are designed exclusively for AI.
  2. Higher performance compared to NVIDIA L40 — twice the tensor power (FP8 and FP16), which is critical for generative AI and LLM inference.
  3. Server reliability: ECC memory and passive cooling for round-the-clock operation.
  4. vGPU support allows resources to be shared between multiple virtual machines — beneficial for data centres and cloud solutions.
  5. AV1 codecs make the card suitable for video processing, streaming and visualisation tasks. 

Comparison with other models

  • L40S vs L40: the basic L40 consumes less power (300 W), but the L40S is almost twice as powerful in tensor operations, making it preferable for generative AI.
  • L40S vs RTX 6000 Ada 48GB: both cards use Ada Lovelace architecture and 48 GB of memory, but the RTX 6000 is a ‘workstation’ for designers and 3D visualisation, while the L40S is a server solution for AI and data centres.
  • L40S vs H100 80GB: The H100 remains the absolute leader in AI performance, but it comes at a higher price. The L40S is more cost-effective when a balance between performance and cost is required.

What is NVIDIA L40S 48GB suitable for?

  • Generative AI (text, images, video);
  • Inference and training of medium and large LLM models;
  • Workstation virtualisation with RTX vWS;
  • engineering graphics and rendering;
  • broadcasting and video processing in AV1.

Why it is beneficial to buy NVIDIA L40S from us

  • Direct delivery from the USA — authenticity guaranteed.
  • 3-year warranty — confidence in reliability.
  • Flexible payment methods: secure payment with VAT/ex VAT, bank cards, USDT.
  • Great price with maximum versatility.

Buy NVIDIA L40S 48GB — get an accelerator that covers several areas at once: from AI computing to professional visualisation and virtualisation.

Product reviews

0
0 reviews
0% average rating
5
0
4
0
3
0
2
0
1
0

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Product Benchmark

Payment Methods

Shipping Methods

FAQ

Customers Also Loved

Gallery

Here goes your text ... Select any part of your text to access the formatting toolbar.

I found 147 items that matched your query "".