INDEX
Explanations
Henchmen/thugs
The neuron is looking for mentions of armed security or protection personnel (e.g., guards, police).
New Auto-Interp
Negative Logits
807
-0.07
вим
-0.06
lan
-0.06
anni
-0.06
Также
-0.06
다른
-0.06
مردم
-0.06
levance
-0.06
914
-0.06
bladder
-0.06
POSITIVE LOGITS
enqu
0.07
rout
0.07
denial
0.06
ندا
0.06
.wr
0.06
uled
0.06
keit
0.06
ORIA
0.06
Http
0.06
versa
0.06
Activations Density 0.032%