DeepSeek V3

Name: DeepSeek V3 Introduction Video
Uploaded: 2025-04-26T19:03:03Z
Duration: 1 min 33 s
Description: Cost-efficient open-source MoE model rivaling GPT-4o in reasoning and math tasks

Free App

Cost-efficient open-source MoE model rivaling GPT-4o in reasoning and math tasks

Claim Offer

Try for Free

About DeepSeek V3

Launched Jan 22, 2025

Introduction Video

Description

Cost-efficient open-source MoE model rivaling GPT-4o in reasoning and math tasks

DeepSeek-V3 is a 671-billion-parameter Mixture-of-Experts (MoE) model with 37B parameters activated per token. It excels in coding, mathematics, and multilingual tasks, outperforming leading open-source models like Qwen2.5-72B and Llama-3.1-405B, and matches closed-source models like GPT-4o and Claude-3.5-Sonnet in benchmarks. Trained on 14.8 trillion tokens using FP8 mixed precision, it achieves state-of-the-art efficiency with a 128K context window and 3x faster generation speed compared to its predecessor