Interesting, so a more capable new model is thinking about prompt injecting the previous generation LLM reviewer to pass the test. What could possibly go wrong? 🤔 From Gemini 3 safety report:
Obviously not a problem with current model capabilities, but if things like this keep happening in the future we can get some nasty surprises.
449