Efficient, open-source AI model rivaling larger competitors with lower resource requirements.
Mistral Small 3 is a 24B-parameter language model released under the Apache 2.0 license. It offers performance comparable to larger models like Llama 3.3 70B while being more than 3x faster on the same hardware. Designed for local deployment, it excels in tasks requiring robust language and instruction following with very low latency. The model can be quantized to run on a single RTX 4090 or a Macbook with 32GB RAM