The main thing this neuron does is find content policy violations and the AI's refusal to generate harmful, illegal, or unethical content, along with explanations of its safety principles.
gemini-2.5-flash
content that includes potentially harmful, explicit, or disturbing themes