Catalog

Model by microsoft

phi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

NVIDIA modelSpeech RecognitionVisual QALanguage GenerationChart and Table UnderstandingchatImage-to-Text