Updated 2026-03-22 07:34:43 +00:00
Multistral is a flexible multimodal small language model that seamlessly combines text, vision, and audio processing capabilities. Built on top of proven architectures including Ministral-3 (text), Pixtral (vision), and Voxtral (audio), Multistral provides, at this time, only a 4B text variant (5B multimodal).
Updated 2026-03-09 08:17:50 +00:00
Updated 2026-03-04 06:35:44 +00:00
Updated 2026-02-13 18:08:43 +00:00
Updated 2026-02-05 17:00:20 +00:00
Updated 2026-02-05 16:49:29 +00:00
Updated 2026-02-05 16:37:49 +00:00
Updated 2026-02-05 16:32:49 +00:00
Updated 2026-01-06 04:01:06 +00:00
Updated 2025-12-15 12:24:39 +00:00