INDEX
Explanations
The neuron detects terms related to official investigations or law‐enforcement authorities (e.g. investigation, police, authorities).
New Auto-Interp
Negative Logits
.quest
-0.07
Farms
-0.07
111
-0.06
alloy
-0.06
рю
-0.06
Cómo
-0.06
نظام
-0.06
aramel
-0.06
familiarity
-0.06
("/")↵-0.06
POSITIVE LOGITS
погод
0.07
تبه
0.07
pose
0.07
MOT
0.06
initialization
0.06
sax
0.06
881
0.06
:');↵
0.06
socket
0.06
]++;↵
0.06
Activations Density 0.011%