Unexpected Results Encountered When Using the DeepSeek-V3.1 API on Vertex AI

Zhouze · December 22, 2025, 10:04am

We are using Vertex AI’s DeepSeek-V3.1 API service. Recently, we have discovered that when the prompt exceeds a certain length (possibly around 30k tokens), the API ceases to return any valid results, despite the HTTP status code being 200.

ilnardo92 · December 22, 2025, 2:32pm

Hi @Zhouze ,

thank you for sharing this!

I will reach out the Vertex AI Inference team to collect feedback about it.

Keep you posted.

Best

Barackboru · December 22, 2025, 6:35pm

The DeepSeek-V3.1 API on Vertex AI stops returning results when prompts exceed roughly 30k tokens, even though the HTTP status shows 200. To avoid this, keep prompts under the token limit or split longer inputs into smaller chunks. The Vertex AI team is aware and reviewing this behavior.

ilnardo92 · December 22, 2025, 8:43pm

Hi @Zhouze ,

The issue has been resolved. Please let me know if it works.

Best

Zhouze · December 23, 2025, 2:04am

Hi @ilnardo92 ,
Thank you for the prompt fix. I have tested the API with an extended prompt (48,417 tokens), and it is now working perfectly. The responses are complete and as expected.

We really appreciate the swift resolution.

For our internal understanding and to help prevent similar issues in the future, would it be possible to briefly share what the root cause was? Any insight would be helpful.

Thanks again for your support.

Best regards,
Zhouze

Topic		Replies	Views
Unexpected Results Encountered When Using the DeepSeek-V3.1 API service on Vertex AI Open Models deepseek	0	38	December 22, 2025
Vertex AI agent response using Gemini-1.5-flash truncated to 512 tokens Custom ML & MLOps agent-platform	3	126	October 15, 2024
NEW Model API on Vertex AI: DeepSeek-V3.1 Open Models open-models , deepseek , inference	8	881	December 3, 2025

Unexpected Results Encountered When Using the DeepSeek-V3.1 API on Vertex AI

AI Suggested topics