intermediate → advancedPRO

NVIDIA AI Development

From CUDA basics to deploying with TensorRT-LLM, NeMo, and NVIDIA NIM microservices.

12 hours6 modules

Coming Soon

Understand GPU threads, memory hierarchy, and CUDA kernels.

Coming Soon

Optimise LLM inference with TensorRT-LLM for production throughput.

Coming Soon

Fine-tune and customise foundation models with the NeMo toolkit.

Coming Soon

Serve models at scale with NVIDIA Triton Inference Server.

Coming Soon

Deploy optimised AI microservices with NVIDIA NIM containers.

Coming Soon

Provision and manage cloud-based DGX infrastructure for large-scale training.

Get The AI Brief

Join thousands of AI professionals. The week's most important stories, every Monday.