← 返回模型库
M
MiMo-V2-Omni
xiaomi
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...
官方定价 输入
$0.1/M Tokens
官方定价 输出
$0.5/M Tokens
LMSYS Elo
-