INDEX
Explanations
This neuron detects mentions and descriptions of flags and flag‐related actions (e.g., flag, raised, draped, flown).
New Auto-Interp
Negative Logits
jobs
-0.07
advice
-0.06
Bard
-0.06
veh
-0.06
care
-0.06
fees
-0.06
Hizmet
-0.06
Mehmet
-0.06
(scan
-0.06
separately
-0.06
POSITIVE LOGITS
flag
0.08
افة
0.08
anthem
0.08
Flag
0.08
λία
0.07
Flag
0.07
LError
0.07
ні
0.06
_aspect
0.06
Abb
0.06
Activations Density 0.007%