Updated 2026-04-22 08:42:53 +00:00
Updated 2026-04-13 19:29:28 +00:00
Updated 2026-04-13 19:24:55 +00:00
Updated 2026-04-02 10:01:09 +00:00
Multistral is a flexible multimodal small language model that seamlessly combines text, vision, and audio processing capabilities. Built on top of proven architectures including Ministral-3 (text), Pixtral (vision), and Voxtral (audio), Multistral provides, at this time, only a 4B text variant (5B multimodal).
Updated 2026-03-09 08:17:50 +00:00
Updated 2026-03-04 06:35:44 +00:00
Updated 2026-01-06 04:01:06 +00:00
Updated 2025-12-15 12:24:39 +00:00