deepseek 文件deepseek v3 newhow does deepseek r1's mixture of experts (moe) architecture enhance its performancedeepseek-r1 model specifications