NVIDIA catalog
Models, skills and blueprints for GPU jobs.
Browse NVIDIA workloads inside ICPX before creating a compute job.
nvidia
Build a Video Search and Summarization (VSS) Agent
Run the VSS Blueprint on your Spark
nvidia
Build and Deploy a Multi-Agent Chatbot
Deploy a multi-agent chatbot system and chat with agents on your Spark
nvidia
CLI Coding Agent
Build local CLI coding agents with Ollama
nvidia
Comfy UI
Install and use Comfy UI to generate images
nvidia
Connect Multiple DGX Spark through a Switch
Set up a cluster of DGX Spark devices that are connected through Switch
nvidia
Connect Three DGX Spark in a Ring Topology
Connect and set up three DGX Spark devices in a ring topology
nvidia
Connect Two Sparks
Connect two Spark devices and setup them up for inference and fine-tuning
nvidia
CUDA-X Data Science
Install and use NVIDIA cuML and NVIDIA cuDF to accelerate UMAP, HDBSCAN, pandas and more with zero code changes
nvidia
cuTile Kernels
Run cuTile kernel benchmarks, FMHA implementation, and LLM inference on DGX Spark and B300
nvidia
DGX Dashboard
Monitor your DGX system and launch JupyterLab
nvidia
DGX Station AI Skills for Coding Agents
Give your coding agent (Claude Code, Codex, Gemini CLI, Cursor) DGX Station expertise via an AGENTS.md and on-demand Agent Skills
nvidia
Fine-tune with NeMo
Use NVIDIA NeMo to fine-tune models locally
nvidia
Fine-tune with Pytorch
Use Pytorch to fine-tune models locally
nvidia
FLUX.1 Dreambooth LoRA Fine-tuning
Fine-tune FLUX.1-dev 12B model using Dreambooth LoRA for custom image generation
nvidia
How to Build a Multi-GPU AI PC - A Practical Guide
Many people explore local generative AI for privacy and to avoid token limits, but newer models require significant memory and compute—leading some to adopt multi-GPU setups.
nvidia
How to Fine-Tune an LLM on NVIDIA GPUs With Unsloth
Fine-tune popular AI models faster in Unsloth with NVIDIA RTX AI PCs, RTX PRO workstations, and DGX Spark—plus explore the new Nemotron Nano 3 family of open models.
nvidia
How to Get Started With Large Language Models on NVIDIA RTX PCs
Learn about using LLMs locally on PCs and workstations with Ollama, AnythingLLM, and LM Studio.
nvidia
How to Get Started With Visual Generative AI on NVIDIA RTX PCs
Learn how to run advanced image and video generation locally with ComfyUI and LTX-2 on RTX PCs.
nvidia
Image & Video Generation with ComfyUI
Generate images and videos with FLUX, Wan 2.1, HunyuanVideo, and Cosmos on DGX Station
nvidia
Install and Use Isaac Sim and Isaac Lab
Build Isaac Sim and Isaac Lab from source for Spark
nvidia
Isaac GR00T N1.6 Fine-Tuning
Fine-tune and benchmark NVIDIA's GR00T N1.6 robotics foundation model on DGX Station
nvidia
Live VLM WebUI
Real-time Vision Language Model interaction with webcam streaming
nvidia
LLaMA Factory
Install and fine-tune models with LLaMA Factory
nvidia
LLM Inference with SGLang
Serve LLMs with SGLang on DGX Station (Qwen3-8B default; Qwen3.6 MoE optional)—prefix-cached multi-turn, structured output, benchmarks, and inference-server guidance
nvidia
LM Studio on DGX Spark
Deploy LM Studio and serve LLMs on a Spark device; use LM Link to access models remotely.
nvidia
Local Coding Agent
Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)
nvidia
Local Healthcare Agent on DGX Station
Run healthcare AI agents that analyze patient data and predict protein structures in an OpenShell sandbox on DGX Station
nvidia
MIG on DGX Station
Enable and configure Multi-Instance GPU (MIG) on DGX Station with GB300 Ultra (B300 GPUs)
nvidia
Multi-modal Inference
Setup multi-modal inference with TensorRT
nvidia
Nanochat on Dual-Spark
Setup Nanochat on Dual-Spark
nvidia
Nanochat Training
Train a small ChatGPT-style LLM (nanochat) with tokenizer, pretraining, midtraining, and SFT on DGX Station with GB300 Ultra
nvidia
NCCL for Two Sparks
Install and test NCCL on two Sparks
nvidia
Nemotron-3-Nano with llama.cpp
Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark
nvidia
NIM on Spark
Deploy a NIM on Spark
nvidia
NVFP4 Pretraining with Megatron Bridge
Pretrain Llama 3.1 8B with NVFP4 mixed precision on DGX Station using Megatron Bridge
nvidia
NVFP4 Quantization
Quantize a model to NVFP4 to run on DGX Station using TensorRT Model Optimizer
nvidia
NVFP4 Quantization
Quantize a model to NVFP4 to run on Spark using TensorRT Model Optimizer
nvidia
NVIDIA Video Generation Guide
Learn how to create videos using LTX-2 in ComfyUI, accelerated on RTX. Learn how to take control of visual generative AI, creating high resolution video on RTX.
nvidia
Open WebUI with Ollama
Install Open WebUI and use Ollama to chat with models on your Spark
nvidia
OpenClaw 🦞
Run OpenClaw locally on DGX Spark with a vLLM-served local model
nvidia
Optimized JAX
Optimize JAX to run on Spark
nvidia
Portfolio Optimization
GPU-Accelerated portfolio optimization using cuOpt and cuML
nvidia
Profiler-Driven Kernel Optimization for Fine-Tuning
Use torch.profiler to find training bottlenecks, then write custom Triton kernels to optimize LLaMA 8B fine-tuning
nvidia
RAG Application in AI Workbench
Install and use AI Workbench to clone and run a reproducible RAG application
nvidia
Register DGX Spark to Brev
Link your DGX Spark to Brev for remote access and shared environments
nvidia
Register DGX Station to Brev
Link your DGX Station to Brev for remote access and sharing
nvidia
Run Hermes Agent with Local Models
Install and run the Hermes self-improving AI agent on DGX Spark.
nvidia
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API
nvidia
Run NemoClaw with a Local LLM
Build your first local AI assistant on DGX Station using NemoClaw in a secure sandbox, with optional Telegram.
nvidia
Run NemoClaw with a Local LLM
Build your first local AI assistant on DGX Spark using NemoClaw and Ollama in a secure sandbox, with optional Telegram.
nvidia
Run OpenClaw For Free On NVIDIA RTX GPUs & DGX Spark
Learn how to set up and host the popular AI agent using local inference apps optimized for RTX.
nvidia
Secure Long Running AI Agents with OpenShell on DGX Station
Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Station
nvidia
Secure Long Running AI Agents with OpenShell on DGX Spark
Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Spark
nvidia
Set Up Local Network Access
NVIDIA Sync helps set up and configure SSH access
nvidia
Set up Tailscale on Your Spark
Use Tailscale to connect to your Spark on your home network no matter where you are
nvidia
SGLang for Inference
Install and use SGLang on DGX Spark
nvidia
Single-cell RNA Sequencing
An end-to-end GPU-powered workflow for scRNA-seq using RAPIDS
nvidia
Spark & Reachy Photo Booth
AI augmented photo booth using the DGX Spark and Reachy Mini.
nvidia
Spark & Reachy Photo Booth
AI augmented photo booth using the DGX Spark and Reachy Mini.
nvidia
Speculative Decoding
Learn how to set up speculative decoding for fast inference on Spark
nvidia
Text to Knowledge Graph
Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
nvidia
Text to Knowledge Graph on DGX Station
Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
nvidia
Topic Modeling
Extract insights from massive text datasets using cuML's GPU-accelerated BERTopic
nvidia
TRT LLM for Inference
Install and use TensorRT-LLM on DGX Spark
nvidia
Unsloth on DGX Spark
Optimized fine-tuning with Unsloth
nvidia
Vibe Coding in VS Code
Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
nvidia
Vision-Language Model Fine-tuning
Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
nvidia
vLLM for Inference
Install and use vLLM on DGX Spark
nvidia
vLLM for Inference
Install and use vLLM on DGX Station
nvidia
vLLM for Inference
Install and use vLLM on NVIDIA RTX Pro 6000
nvidia
VS Code
Install and use VS Code locally or remotely
nvidia
🦞 Set Up Example NemoClaw Agents 🦞
Ready-to-run application examples for your NemoClaw sandbox — policy, prompt, and personalization for each workflow
nvidia
Build a Video Search and Summarization (VSS) Agent
Run the VSS Blueprint on your Spark
nvidia
Build and Deploy a Multi-Agent Chatbot
Deploy a multi-agent chatbot system and chat with agents on your Spark
nvidia
CLI Coding Agent
Build local CLI coding agents with Ollama
nvidia
Comfy UI
Install and use Comfy UI to generate images
nvidia
Connect Multiple DGX Spark through a Switch
Set up a cluster of DGX Spark devices that are connected through Switch
nvidia
Connect Three DGX Spark in a Ring Topology
Connect and set up three DGX Spark devices in a ring topology
nvidia
Connect Two Sparks
Connect two Spark devices and setup them up for inference and fine-tuning
nvidia
CUDA-X Data Science
Install and use NVIDIA cuML and NVIDIA cuDF to accelerate UMAP, HDBSCAN, pandas and more with zero code changes
nvidia
cuTile Kernels
Run cuTile kernel benchmarks, FMHA implementation, and LLM inference on DGX Spark and B300
nvidia
DGX Dashboard
Monitor your DGX system and launch JupyterLab
nvidia
DGX Station AI Skills for Coding Agents
Give your coding agent (Claude Code, Codex, Gemini CLI, Cursor) DGX Station expertise via an AGENTS.md and on-demand Agent Skills
nvidia
Fine-tune with NeMo
Use NVIDIA NeMo to fine-tune models locally
nvidia
Fine-tune with Pytorch
Use Pytorch to fine-tune models locally
nvidia
FLUX.1 Dreambooth LoRA Fine-tuning
Fine-tune FLUX.1-dev 12B model using Dreambooth LoRA for custom image generation
nvidia
How to Build a Multi-GPU AI PC - A Practical Guide
Many people explore local generative AI for privacy and to avoid token limits, but newer models require significant memory and compute—leading some to adopt multi-GPU setups.
nvidia
How to Fine-Tune an LLM on NVIDIA GPUs With Unsloth
Fine-tune popular AI models faster in Unsloth with NVIDIA RTX AI PCs, RTX PRO workstations, and DGX Spark—plus explore the new Nemotron Nano 3 family of open models.
nvidia
How to Get Started With Large Language Models on NVIDIA RTX PCs
Learn about using LLMs locally on PCs and workstations with Ollama, AnythingLLM, and LM Studio.
nvidia
How to Get Started With Visual Generative AI on NVIDIA RTX PCs
Learn how to run advanced image and video generation locally with ComfyUI and LTX-2 on RTX PCs.
nvidia
Image & Video Generation with ComfyUI
Generate images and videos with FLUX, Wan 2.1, HunyuanVideo, and Cosmos on DGX Station
nvidia
Install and Use Isaac Sim and Isaac Lab
Build Isaac Sim and Isaac Lab from source for Spark
nvidia
Isaac GR00T N1.6 Fine-Tuning
Fine-tune and benchmark NVIDIA's GR00T N1.6 robotics foundation model on DGX Station
nvidia
Live VLM WebUI
Real-time Vision Language Model interaction with webcam streaming
nvidia
LLaMA Factory
Install and fine-tune models with LLaMA Factory
nvidia
LLM Inference with SGLang
Serve LLMs with SGLang on DGX Station (Qwen3-8B default; Qwen3.6 MoE optional)—prefix-cached multi-turn, structured output, benchmarks, and inference-server guidance
nvidia
LM Studio on DGX Spark
Deploy LM Studio and serve LLMs on a Spark device; use LM Link to access models remotely.
nvidia
Local Coding Agent
Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)
nvidia
Local Healthcare Agent on DGX Station
Run healthcare AI agents that analyze patient data and predict protein structures in an OpenShell sandbox on DGX Station
nvidia
MIG on DGX Station
Enable and configure Multi-Instance GPU (MIG) on DGX Station with GB300 Ultra (B300 GPUs)
nvidia
Multi-modal Inference
Setup multi-modal inference with TensorRT
nvidia
Nanochat on Dual-Spark
Setup Nanochat on Dual-Spark
nvidia
Nanochat Training
Train a small ChatGPT-style LLM (nanochat) with tokenizer, pretraining, midtraining, and SFT on DGX Station with GB300 Ultra
nvidia
NCCL for Two Sparks
Install and test NCCL on two Sparks
nvidia
Nemotron-3-Nano with llama.cpp
Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark
nvidia
NIM on Spark
Deploy a NIM on Spark
nvidia
NVFP4 Pretraining with Megatron Bridge
Pretrain Llama 3.1 8B with NVFP4 mixed precision on DGX Station using Megatron Bridge
nvidia
NVFP4 Quantization
Quantize a model to NVFP4 to run on DGX Station using TensorRT Model Optimizer
nvidia
NVFP4 Quantization
Quantize a model to NVFP4 to run on Spark using TensorRT Model Optimizer
nvidia
NVIDIA Video Generation Guide
Learn how to create videos using LTX-2 in ComfyUI, accelerated on RTX. Learn how to take control of visual generative AI, creating high resolution video on RTX.
nvidia
Open WebUI with Ollama
Install Open WebUI and use Ollama to chat with models on your Spark
nvidia
OpenClaw 🦞
Run OpenClaw locally on DGX Spark with a vLLM-served local model
nvidia
Optimized JAX
Optimize JAX to run on Spark
nvidia
Portfolio Optimization
GPU-Accelerated portfolio optimization using cuOpt and cuML
nvidia
Profiler-Driven Kernel Optimization for Fine-Tuning
Use torch.profiler to find training bottlenecks, then write custom Triton kernels to optimize LLaMA 8B fine-tuning
nvidia
RAG Application in AI Workbench
Install and use AI Workbench to clone and run a reproducible RAG application
nvidia
Register DGX Spark to Brev
Link your DGX Spark to Brev for remote access and shared environments
nvidia
Register DGX Station to Brev
Link your DGX Station to Brev for remote access and sharing
nvidia
Run Hermes Agent with Local Models
Install and run the Hermes self-improving AI agent on DGX Spark.
nvidia
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API
nvidia
Run NemoClaw with a Local LLM
Build your first local AI assistant on DGX Station using NemoClaw in a secure sandbox, with optional Telegram.
nvidia
Run NemoClaw with a Local LLM
Build your first local AI assistant on DGX Spark using NemoClaw and Ollama in a secure sandbox, with optional Telegram.
nvidia
Run OpenClaw For Free On NVIDIA RTX GPUs & DGX Spark
Learn how to set up and host the popular AI agent using local inference apps optimized for RTX.
nvidia
Secure Long Running AI Agents with OpenShell on DGX Station
Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Station
nvidia
Secure Long Running AI Agents with OpenShell on DGX Spark
Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Spark
nvidia
Set Up Local Network Access
NVIDIA Sync helps set up and configure SSH access
nvidia
Set up Tailscale on Your Spark
Use Tailscale to connect to your Spark on your home network no matter where you are
nvidia
SGLang for Inference
Install and use SGLang on DGX Spark
nvidia
Single-cell RNA Sequencing
An end-to-end GPU-powered workflow for scRNA-seq using RAPIDS
nvidia
Spark & Reachy Photo Booth
AI augmented photo booth using the DGX Spark and Reachy Mini.
nvidia
Spark & Reachy Photo Booth
AI augmented photo booth using the DGX Spark and Reachy Mini.
nvidia
Speculative Decoding
Learn how to set up speculative decoding for fast inference on Spark
nvidia
Text to Knowledge Graph
Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
nvidia
Text to Knowledge Graph on DGX Station
Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
nvidia
Topic Modeling
Extract insights from massive text datasets using cuML's GPU-accelerated BERTopic
nvidia
TRT LLM for Inference
Install and use TensorRT-LLM on DGX Spark
nvidia
Unsloth on DGX Spark
Optimized fine-tuning with Unsloth
nvidia
Vibe Coding in VS Code
Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
nvidia
Vision-Language Model Fine-tuning
Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
nvidia
vLLM for Inference
Install and use vLLM on DGX Spark
nvidia
vLLM for Inference
Install and use vLLM on DGX Station
nvidia
vLLM for Inference
Install and use vLLM on NVIDIA RTX Pro 6000
nvidia
VS Code
Install and use VS Code locally or remotely
nvidia
🦞 Set Up Example NemoClaw Agents 🦞
Ready-to-run application examples for your NemoClaw sandbox — policy, prompt, and personalization for each workflow