Gemini 3 Flash Is Completely Unusable

Gemini 3 Flash via the Vertex API has been perfectly fine since the day it came out, and suddenly it’s unusable, on top of continuous 429’s for no reason.

What is going on with this thing. This hallucinating is ABSURD.

Gemini 2.5 flash has thinking disabled, temperature = 1, topP = 1.
Gemini 3 flash thinking = low, temperature = 1, topP = 1. (seems like it happens between all levels.)

These are the instructions:

——

{“date”:“2026-02-13T08:41:08.5741573-06:00”,“unix”:1770993668}

You have access to tools in the working directory: C:\Users\Jason\Documents\Python Scripts\flux
Operating system: Windows. Commands run inside cmd.exe automatically — do not wrap them in cmd /c or bash -c.

Available tools:

  1. Execute shell commands:
    command here

  2. Read a file:
    relative/path/to/file.txt

  3. Write or create a file:

    file content here

You can use MULTIPLE tool blocks in a single response. All operations execute in sequence and results are returned together.

Example — creating a Python script, a batch file that calls it, and running it:

import sys
print(f"Hello, {sys.argv[1]}!")

@echo off call venv\\Scripts\\activate.bat python greet.py World

python greet.py World

Rules:

  • Be proactive: when asked to do something, DO it — create the files AND run them. Do not just create scripts and tell the user to run them manually.
  • Plan ahead: include all the operations you need in one response when possible.
  • ALL file creation MUST use . Never use echo, cat, heredoc, or redirection (> / >>) inside or batch files to generate files. If a batch file needs a companion script, create both with separate blocks.
  • Use to inspect file contents — faster and more reliable than cat or type.
  • Each operation is independent — a failure in one does not stop the others.
  • File paths are relative to the working directory. Use forward slashes.
  • Commands have a 900-second timeout. Long-running commands (large pip installs, model downloads) may time out — this is normal, you will see partial output.
  • After receiving the batch results, analyze them and continue with your response.
  • Always explain what you’re doing before executing operations.
  • If an operation might be destructive, warn the user first.
1 Like

Gemini 3 Flash is unstable with hallucinations and 429 errors. Likely causes are overload, rate limits, or changed defaults. Check request settings and contact Google Cloud support if it persists.