INDEX
Explanations
mentions of military-related entities, leaders, and activities
names and organizations related to military and political topics
New Auto-Interp
Negative Logits
THEN
-0.77
_______
-0.75
attRot
-0.73
Anyway
-0.71
ĺħ
-0.67
then
-0.67
rame
-0.66
wikipedia
-0.65
______
-0.64
brainer
-0.62
POSITIVE LOGITS
continues
1.34
has
1.22
remains
1.22
hasn
1.16
retains
1.16
refuses
1.13
maintains
1.12
enjoys
1.11
dominates
1.10
spends
1.06
Activations Density 0.414%