INDEX
Explanations
The neuron detects mentions of the word “banana” (including its plural and derivatives).
New Auto-Interp
Negative Logits
UserRole
-0.08
�
-0.07
760
-0.07
第四
-0.07
*$
-0.07
Pep
-0.07
Cadastro
-0.06
785
-0.06
4
-0.06
四
-0.06
POSITIVE LOGITS
bananas
0.12
banana
0.11
Banana
0.10
ana
0.07
line
0.07
ange
0.07
à
0.07
andas
0.07
antages
0.06
(LayoutInflater
0.06
Activations Density 0.003%