Models, skills and blueprints for GPU jobs.

NVIDIA blueprintFine-TuningRTXLLMGPU

nvidia

How to Fine-Tune an LLM on NVIDIA GPUs With Unsloth

Fine-tune popular AI models faster in Unsloth with NVIDIA RTX AI PCs, RTX PRO workstations, and DGX Spark—plus explore the new Nemotron Nano 3 family of open models.

NVIDIA blueprintLLMsOllamaRTXAnythingLLM

nvidia

How to Get Started With Large Language Models on NVIDIA RTX PCs

Learn about using LLMs locally on PCs and workstations with Ollama, AnythingLLM, and LM Studio.

NVIDIA blueprintGen AIComfyUILTX-2RTX

nvidia

How to Get Started With Visual Generative AI on NVIDIA RTX PCs

Learn how to run advanced image and video generation locally with ComfyUI and LTX-2 on RTX PCs.

NVIDIA blueprintStationImage GenerationComfyUIDocker

nvidia

Image & Video Generation with ComfyUI

Generate images and videos with FLUX, Wan 2.1, HunyuanVideo, and Cosmos on DGX Station

nvidia

Install and Use Isaac Sim and Isaac Lab

Build Isaac Sim and Isaac Lab from source for Spark

NVIDIA blueprintStationFine-TuningIsaac GR00TBlackwell

nvidia

Isaac GR00T N1.6 Fine-Tuning

Fine-tune and benchmark NVIDIA's GR00T N1.6 robotics foundation model on DGX Station

NVIDIA blueprintVision AIDGXVLMSpark

nvidia

Live VLM WebUI

Real-time Vision Language Model interaction with webcam streaming

nvidia

LLaMA Factory

Install and fine-tune models with LLaMA Factory

NVIDIA blueprintStationRadixAttentionStructured OutputBlackwell

nvidia

LLM Inference with SGLang

Serve LLMs with SGLang on DGX Station (Qwen3-8B default; Qwen3.6 MoE optional)—prefix-cached multi-turn, structured output, benchmarks, and inference-server guidance

NVIDIA blueprintInferencellmsterLM StudioLM Link

nvidia

LM Studio on DGX Spark

Deploy LM Studio and serve LLMs on a Spark device; use LM Link to access models remotely.

NVIDIA blueprintStationCodingOllamaClaude Code

nvidia

Local Coding Agent

Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)

NVIDIA blueprintStationOpenFold3NemoClawNemotron

nvidia

Local Healthcare Agent on DGX Station

Run healthcare AI agents that analyze patient data and predict protein structures in an OpenShell sandbox on DGX Station

NVIDIA blueprintStationSystem ConfigurationDGX StationMIG

nvidia

MIG on DGX Station

Enable and configure Multi-Instance GPU (MIG) on DGX Station with GB300 Ultra (B300 GPUs)

nvidia

Multi-modal Inference

Setup multi-modal inference with TensorRT

nvidia

Nanochat on Dual-Spark

Setup Nanochat on Dual-Spark

NVIDIA blueprintTrainingnanochatPyTorchDGX Station

nvidia

Nanochat Training

Train a small ChatGPT-style LLM (nanochat) with tokenizer, pretraining, midtraining, and SFT on DGX Station with GB300 Ultra

nvidia

NCCL for Two Sparks

Install and test NCCL on two Sparks

NVIDIA blueprintNemotronInferenceLLMllama.cpp

nvidia

Nemotron-3-Nano with llama.cpp

Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark

nvidia

NIM on Spark

Deploy a NIM on Spark

NVIDIA blueprintTrainingNVFP4Megatron BridgeStation

nvidia

NVFP4 Pretraining with Megatron Bridge

Pretrain Llama 3.1 8B with NVFP4 mixed precision on DGX Station using Megatron Bridge

NVIDIA blueprintStationDGX

nvidia

NVFP4 Quantization

Quantize a model to NVFP4 to run on DGX Station using TensorRT Model Optimizer

nvidia

NVFP4 Quantization

Quantize a model to NVFP4 to run on Spark using TensorRT Model Optimizer

NVIDIA blueprintComfyUILTX-2RTX

nvidia

NVIDIA Video Generation Guide

Learn how to create videos using LTX-2 in ComfyUI, accelerated on RTX. Learn how to take control of visual generative AI, creating high resolution video on RTX.

nvidia

Open WebUI with Ollama

Install Open WebUI and use Ollama to chat with models on your Spark

NVIDIA blueprintDGXSparkLocal LLMAI Agent

nvidia

OpenClaw 🦞

Run OpenClaw locally on DGX Spark with a vLLM-served local model

nvidia

Optimized JAX

Optimize JAX to run on Spark

NVIDIA blueprintData ScienceRAPIDSFinancial Services

nvidia

Portfolio Optimization

GPU-Accelerated portfolio optimization using cuOpt and cuML

NVIDIA blueprintTrainingFine-TuningPerformance OptimizationKernel Development

nvidia

Profiler-Driven Kernel Optimization for Fine-Tuning

Use torch.profiler to find training bottlenecks, then write custom Triton kernels to optimize LLaMA 8B fine-tuning

nvidia

RAG Application in AI Workbench

Install and use AI Workbench to clone and run a reproducible RAG application

NVIDIA blueprintDGX SparkBrevSpark

nvidia

Register DGX Spark to Brev

Link your DGX Spark to Brev for remote access and shared environments

NVIDIA blueprintStationDGX StationBrev

nvidia

Register DGX Station to Brev

Link your DGX Station to Brev for remote access and sharing

NVIDIA blueprintNous ResearchLLMAI AgentSpark

nvidia

Run Hermes Agent with Local Models

Install and run the Hermes self-improving AI agent on DGX Spark.

NVIDIA blueprintDGX SparkInferenceLLMllama.cpp

nvidia

Run models with llama.cpp on DGX Spark

Build llama.cpp with CUDA and serve models via an OpenAI-compatible API

NVIDIA blueprintStationTelegramAgentic WorkflowNemoClaw

nvidia

Run NemoClaw with a Local LLM

Build your first local AI assistant on DGX Station using NemoClaw in a secure sandbox, with optional Telegram.

NVIDIA blueprintTelegramDGX SparkAgentic WorkflowNemoClaw

nvidia

Run NemoClaw with a Local LLM

Build your first local AI assistant on DGX Spark using NemoClaw and Ollama in a secure sandbox, with optional Telegram.

NVIDIA blueprintDGX SparkOpenClawRTX

nvidia

Run OpenClaw For Free On NVIDIA RTX GPUs & DGX Spark

Learn how to set up and host the popular AI agent using local inference apps optimized for RTX.

NVIDIA blueprintStationDGX StationOpenShellSecurity

nvidia

Secure Long Running AI Agents with OpenShell on DGX Station

Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Station

NVIDIA blueprintDGXOpenShellSparkSecurity

nvidia

Secure Long Running AI Agents with OpenShell on DGX Spark

Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Spark

nvidia

Set Up Local Network Access

NVIDIA Sync helps set up and configure SSH access

nvidia

Set up Tailscale on Your Spark

Use Tailscale to connect to your Spark on your home network no matter where you are

nvidia

SGLang for Inference

Install and use SGLang on DGX Spark

NVIDIA blueprintdata science

nvidia

Single-cell RNA Sequencing

An end-to-end GPU-powered workflow for scRNA-seq using RAPIDS

nvidia

Spark & Reachy Photo Booth

AI augmented photo booth using the DGX Spark and Reachy Mini.

nvidia

Spark & Reachy Photo Booth

AI augmented photo booth using the DGX Spark and Reachy Mini.

nvidia

Speculative Decoding

Learn how to set up speculative decoding for fast inference on Spark

NVIDIA blueprintGraphRAGKnowledge GraphsNLPDGX

nvidia

Text to Knowledge Graph

Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization

NVIDIA blueprintGraphRAGKnowledge GraphsNLPOllama

nvidia

Text to Knowledge Graph on DGX Station

Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization

NVIDIA blueprintData ScienceNLPBERTopicMachine Learning

nvidia

Topic Modeling

Extract insights from massive text datasets using cuML's GPU-accelerated BERTopic

nvidia

TRT LLM for Inference

Install and use TensorRT-LLM on DGX Spark

nvidia

Unsloth on DGX Spark

Optimized fine-tuning with Unsloth

NVIDIA blueprintDGXVibeCodingSpark

nvidia

Vibe Coding in VS Code

Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue

NVIDIA blueprintDGXImage UnderstandingVision-Language ModelsGRPO

nvidia

Vision-Language Model Fine-tuning

Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3

nvidia

vLLM for Inference

Install and use vLLM on DGX Spark

NVIDIA blueprintStationvLLMInference

nvidia

vLLM for Inference

Install and use vLLM on DGX Station

NVIDIA blueprintvLLMInferenceRTX

nvidia

vLLM for Inference

Install and use vLLM on NVIDIA RTX Pro 6000

nvidia

VS Code

Install and use VS Code locally or remotely

NVIDIA blueprintPersonal AssistantTelegramApplicationsDGX Spark

nvidia

🦞 Set Up Example NemoClaw Agents 🦞

Ready-to-run application examples for your NemoClaw sandbox — policy, prompt, and personalization for each workflow

nvidia

Build a Video Search and Summarization (VSS) Agent

Run the VSS Blueprint on your Spark

NVIDIA blueprintDGXAgentsSpark

nvidia

Build and Deploy a Multi-Agent Chatbot

Deploy a multi-agent chatbot system and chat with agents on your Spark

NVIDIA blueprintCodingOllamaClaude CodeOpenCode

nvidia

CLI Coding Agent

Build local CLI coding agents with Ollama

nvidia

Comfy UI

Install and use Comfy UI to generate images

nvidia

Connect Multiple DGX Spark through a Switch

Set up a cluster of DGX Spark devices that are connected through Switch

nvidia

Connect Three DGX Spark in a Ring Topology

Connect and set up three DGX Spark devices in a ring topology

nvidia

Connect Two Sparks

Connect two Spark devices and setup them up for inference and fine-tuning

NVIDIA blueprintpandasdimensionality reductiondata analyticsDGX

nvidia

CUDA-X Data Science

Install and use NVIDIA cuML and NVIDIA cuDF to accelerate UMAP, HDBSCAN, pandas and more with zero code changes

NVIDIA blueprintFMHACross-PlatformDeepSeekDocker

nvidia

cuTile Kernels

Run cuTile kernel benchmarks, FMHA implementation, and LLM inference on DGX Spark and B300

nvidia

DGX Dashboard

Monitor your DGX system and launch JupyterLab

NVIDIA blueprintvLLMAI AgentsBlackwellDGX Station

nvidia

DGX Station AI Skills for Coding Agents

Give your coding agent (Claude Code, Codex, Gemini CLI, Cursor) DGX Station expertise via an AGENTS.md and on-demand Agent Skills

nvidia

Fine-tune with NeMo

Use NVIDIA NeMo to fine-tune models locally

nvidia

Fine-tune with Pytorch

Use Pytorch to fine-tune models locally

NVIDIA blueprintImage GenerationComfyUIDGXLoRA

nvidia

FLUX.1 Dreambooth LoRA Fine-tuning

Fine-tune FLUX.1-dev 12B model using Dreambooth LoRA for custom image generation

NVIDIA blueprintComfyUILlama.cppRTX

nvidia

How to Build a Multi-GPU AI PC - A Practical Guide

Many people explore local generative AI for privacy and to avoid token limits, but newer models require significant memory and compute—leading some to adopt multi-GPU setups.

NVIDIA blueprintFine-TuningRTXLLMGPU

nvidia

How to Fine-Tune an LLM on NVIDIA GPUs With Unsloth

Fine-tune popular AI models faster in Unsloth with NVIDIA RTX AI PCs, RTX PRO workstations, and DGX Spark—plus explore the new Nemotron Nano 3 family of open models.

NVIDIA blueprintLLMsOllamaRTXAnythingLLM

nvidia

How to Get Started With Large Language Models on NVIDIA RTX PCs

Learn about using LLMs locally on PCs and workstations with Ollama, AnythingLLM, and LM Studio.

NVIDIA blueprintGen AIComfyUILTX-2RTX

nvidia

How to Get Started With Visual Generative AI on NVIDIA RTX PCs

Learn how to run advanced image and video generation locally with ComfyUI and LTX-2 on RTX PCs.

NVIDIA blueprintStationImage GenerationComfyUIDocker

nvidia

Image & Video Generation with ComfyUI

Generate images and videos with FLUX, Wan 2.1, HunyuanVideo, and Cosmos on DGX Station

nvidia

Install and Use Isaac Sim and Isaac Lab

Build Isaac Sim and Isaac Lab from source for Spark

NVIDIA blueprintStationFine-TuningIsaac GR00TBlackwell

nvidia

Isaac GR00T N1.6 Fine-Tuning

Fine-tune and benchmark NVIDIA's GR00T N1.6 robotics foundation model on DGX Station

NVIDIA blueprintVision AIDGXVLMSpark

nvidia

Live VLM WebUI

Real-time Vision Language Model interaction with webcam streaming

nvidia

LLaMA Factory

Install and fine-tune models with LLaMA Factory

NVIDIA blueprintStationRadixAttentionStructured OutputBlackwell

nvidia

LLM Inference with SGLang

Serve LLMs with SGLang on DGX Station (Qwen3-8B default; Qwen3.6 MoE optional)—prefix-cached multi-turn, structured output, benchmarks, and inference-server guidance

NVIDIA blueprintInferencellmsterLM StudioLM Link

nvidia

LM Studio on DGX Spark

Deploy LM Studio and serve LLMs on a Spark device; use LM Link to access models remotely.

NVIDIA blueprintStationCodingOllamaClaude Code

nvidia

Local Coding Agent

Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)

NVIDIA blueprintStationOpenFold3NemoClawNemotron

nvidia

Local Healthcare Agent on DGX Station

Run healthcare AI agents that analyze patient data and predict protein structures in an OpenShell sandbox on DGX Station

NVIDIA blueprintStationSystem ConfigurationDGX StationMIG

nvidia

MIG on DGX Station

Enable and configure Multi-Instance GPU (MIG) on DGX Station with GB300 Ultra (B300 GPUs)

nvidia

Multi-modal Inference

Setup multi-modal inference with TensorRT

nvidia

Nanochat on Dual-Spark

Setup Nanochat on Dual-Spark

NVIDIA blueprintTrainingnanochatPyTorchDGX Station

nvidia

Nanochat Training

Train a small ChatGPT-style LLM (nanochat) with tokenizer, pretraining, midtraining, and SFT on DGX Station with GB300 Ultra

nvidia

NCCL for Two Sparks

Install and test NCCL on two Sparks

NVIDIA blueprintNemotronInferenceLLMllama.cpp

nvidia

Nemotron-3-Nano with llama.cpp

Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark

nvidia

NIM on Spark

Deploy a NIM on Spark

NVIDIA blueprintTrainingNVFP4Megatron BridgeStation

nvidia

NVFP4 Pretraining with Megatron Bridge

Pretrain Llama 3.1 8B with NVFP4 mixed precision on DGX Station using Megatron Bridge

NVIDIA blueprintStationDGX

nvidia

NVFP4 Quantization

Quantize a model to NVFP4 to run on DGX Station using TensorRT Model Optimizer

nvidia

NVFP4 Quantization

Quantize a model to NVFP4 to run on Spark using TensorRT Model Optimizer

NVIDIA blueprintComfyUILTX-2RTX

nvidia

NVIDIA Video Generation Guide

Learn how to create videos using LTX-2 in ComfyUI, accelerated on RTX. Learn how to take control of visual generative AI, creating high resolution video on RTX.

nvidia

Open WebUI with Ollama

Install Open WebUI and use Ollama to chat with models on your Spark

NVIDIA blueprintDGXSparkLocal LLMAI Agent

nvidia

OpenClaw 🦞

Run OpenClaw locally on DGX Spark with a vLLM-served local model

nvidia

Optimized JAX

Optimize JAX to run on Spark

NVIDIA blueprintData ScienceRAPIDSFinancial Services

nvidia

Portfolio Optimization

GPU-Accelerated portfolio optimization using cuOpt and cuML

NVIDIA blueprintTrainingFine-TuningPerformance OptimizationKernel Development

nvidia

Profiler-Driven Kernel Optimization for Fine-Tuning

Use torch.profiler to find training bottlenecks, then write custom Triton kernels to optimize LLaMA 8B fine-tuning

nvidia

RAG Application in AI Workbench

Install and use AI Workbench to clone and run a reproducible RAG application

NVIDIA blueprintDGX SparkBrevSpark

nvidia

Register DGX Spark to Brev

Link your DGX Spark to Brev for remote access and shared environments

NVIDIA blueprintStationDGX StationBrev

nvidia

Register DGX Station to Brev

Link your DGX Station to Brev for remote access and sharing

NVIDIA blueprintNous ResearchLLMAI AgentSpark

nvidia

Run Hermes Agent with Local Models

Install and run the Hermes self-improving AI agent on DGX Spark.

NVIDIA blueprintDGX SparkInferenceLLMllama.cpp

nvidia

Run models with llama.cpp on DGX Spark

Build llama.cpp with CUDA and serve models via an OpenAI-compatible API

NVIDIA blueprintStationTelegramAgentic WorkflowNemoClaw

nvidia

Run NemoClaw with a Local LLM

Build your first local AI assistant on DGX Station using NemoClaw in a secure sandbox, with optional Telegram.

NVIDIA blueprintTelegramDGX SparkAgentic WorkflowNemoClaw

nvidia

Run NemoClaw with a Local LLM

Build your first local AI assistant on DGX Spark using NemoClaw and Ollama in a secure sandbox, with optional Telegram.

NVIDIA blueprintDGX SparkOpenClawRTX

nvidia

Run OpenClaw For Free On NVIDIA RTX GPUs & DGX Spark

Learn how to set up and host the popular AI agent using local inference apps optimized for RTX.

NVIDIA blueprintStationDGX StationOpenShellSecurity

nvidia

Secure Long Running AI Agents with OpenShell on DGX Station

Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Station

NVIDIA blueprintDGXOpenShellSparkSecurity

nvidia

Secure Long Running AI Agents with OpenShell on DGX Spark

Run OpenClaw with local models in an NVIDIA OpenShell sandbox on DGX Spark

nvidia

Set Up Local Network Access

NVIDIA Sync helps set up and configure SSH access

nvidia

Set up Tailscale on Your Spark

Use Tailscale to connect to your Spark on your home network no matter where you are

nvidia

SGLang for Inference

Install and use SGLang on DGX Spark

NVIDIA blueprintdata science

nvidia

Single-cell RNA Sequencing

An end-to-end GPU-powered workflow for scRNA-seq using RAPIDS

nvidia

Spark & Reachy Photo Booth

AI augmented photo booth using the DGX Spark and Reachy Mini.

nvidia

Spark & Reachy Photo Booth

AI augmented photo booth using the DGX Spark and Reachy Mini.

nvidia

Speculative Decoding

Learn how to set up speculative decoding for fast inference on Spark

NVIDIA blueprintGraphRAGKnowledge GraphsNLPDGX

nvidia

Text to Knowledge Graph

Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization

NVIDIA blueprintGraphRAGKnowledge GraphsNLPOllama

nvidia

Text to Knowledge Graph on DGX Station

Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization

NVIDIA blueprintData ScienceNLPBERTopicMachine Learning

nvidia

Topic Modeling

Extract insights from massive text datasets using cuML's GPU-accelerated BERTopic

nvidia

TRT LLM for Inference

Install and use TensorRT-LLM on DGX Spark

nvidia

Unsloth on DGX Spark

Optimized fine-tuning with Unsloth

NVIDIA blueprintDGXVibeCodingSpark

nvidia

Vibe Coding in VS Code

Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue

NVIDIA blueprintDGXImage UnderstandingVision-Language ModelsGRPO

nvidia

Vision-Language Model Fine-tuning

Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3

nvidia

vLLM for Inference

Install and use vLLM on DGX Spark

NVIDIA blueprintStationvLLMInference

nvidia

vLLM for Inference

Install and use vLLM on DGX Station

NVIDIA blueprintvLLMInferenceRTX

nvidia

vLLM for Inference

Install and use vLLM on NVIDIA RTX Pro 6000

nvidia

VS Code

Install and use VS Code locally or remotely