INDEX
Explanations
The main thing this neuron does is find mentions of territories being annexed
terms related to territorial acquisition or annexation
New Auto-Interp
Negative Logits
pid
-0.73
orah
-0.71
pton
-0.70
acter
-0.69
ukemia
-0.67
DAY
-0.64
$$
-0.59
xious
-0.59
fres
-0.58
life
-0.57
POSITIVE LOGITS
annexed
1.13
annexation
1.06
annex
0.88
itized
0.85
ãĥ¼ãĥĨãĤ£
0.84
ificant
0.84
imates
0.84
éĹĺ
0.80
ãģĨ
0.78
hers
0.77
Activations Density 0.013%