Skill by nvidia

tao-generate-image-grounding

Two-step image grounding pipeline: extracts referring expressions from (image, caption) pairs and grounds them to pixel-space bounding boxes via a VLM. Use when the user wants to ground captions to bboxes, generate phrase-grounded annotations, auto-label

NVIDIA skillDeveloperAI EngineerData ScientistMl EngineerAI And Machine LearningTAO Toolkit

Open dashboard NVIDIA source