INDEX
Explanations
Explanation of neuron 4 behavior: the main thing this neuron does is find instances of the phrase “and/or” in license or documentation boilerplate.
New Auto-Interp
Negative Logits
SubMenu
-0.07
"<<
-0.07
podnik
-0.06
judul
-0.06
otras
-0.06
jiné
-0.06
蜜
-0.06
นาง
-0.06
método
-0.06
otros
-0.06
POSITIVE LOGITS
^{°}0.06
ourney
0.06
pakistan
0.06
ırı
0.06
Lithuania
0.06
vraiment
0.06
kı
0.06
ucked
0.06
/
0.06
disproportion
0.06
Activations Density 0.001%