Catalog
Model by microsoft
phi-4-multimodal-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
NVIDIA modelSpeech RecognitionVisual QALanguage GenerationChart and Table UnderstandingchatImage-to-Text