Catalog
Blueprint by nvidia
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API
NVIDIA blueprintDGX SparkInferenceLLMllama.cppSpark
Blueprint by nvidia
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API