DApp Store | Pusat Web3 untuk Event & Game

Topik trending

DeepSeek launches V3.1, unifying V3 and R1 into a hybrid reasoning model with an incremental increase in intelligence Incremental intelligence increase: Initial benchmarking results for DeepSeek V3.1 show Artificial Analysis Intelligence Index of 60 in reasoning mode, up from the R1’s score of 59. In non-reasoning mode, V3.1 achieves a score of 49, a greater increase from the earlier V3 0324 score of 44. This leaves V3.1 (reasoning) behind Alibaba’s latest Qwen3 235B 2507 (reasoning) - DeepSeek has not taken back the lead. Hybrid reasoning: @deepseek_ai has moved to a hybrid reasoning model for the first time - supporting both reasoning and non-reasoning modes. DeepSeek’s move to a unified hybrid reasoning model mimics the approach taken by OpenAI, Anthropic and Google. It is interesting to note, however, that Alibaba recently abandoned their the hybrid approach they favored for Qwen3 with their separate releases of Qwen3 2507 reasoning and instruct models. Function calling / tool use: While DeepSeek claims improved function calling for the model, DeepSeek V3.1 does not support function calling when in reasoning mode. This is likely to substantially limit its ability to support agentic workflows with intelligence requirements, including in coding agents. Token usage: DeepSeek V3.1 scores incrementally higher in reasoning mode than DeepSeek R1, and uses slightly fewer tokens across the evals we use for Artificial Analysis Intelligence Index. In non-reasoning mode, it uses slightly more tokens than V3 0324 - but still several times fewer than in its own reasoning mode. API: DeepSeek’s first party API now serves the new DeepSeek V3.1 model on both their chat and reasoning endpoints - simply changing whether the end thinking </think> token is provided to the model in the chat template to control whether the model will reason. Architecture: DeepSeek V3.1 is architecturally identical to prior V3 and R1 models, with 671B total parameters and 37B active parameters. Implications: We would advise caution in making any assumptions about what this release implies about DeepSeek’s progress toward a future model referred to in rumors as V4 or R2. We note that DeepSeek previously released the final model built on their V2 architecture on December 10 2024, just two weeks before releasing V3.

Teratas

Peringkat

Favorit