INDEX
Explanations
phrases related to weapons and military actions
references to historical events or conditions related to conflict and its consequences
New Auto-Interp
Negative Logits
eleph
-1.15
pione
-0.99
exting
-0.96
conduc
-0.96
streng
-0.94
conclud
-0.93
destro
-0.90
tremend
-0.89
ortunately
-0.86
unnecess
-0.86
POSITIVE LOGITS
under
1.05
appropriate
0.96
inar
0.92
unders
0.91
emphasis
0.90
lore
0.90
urban
0.90
unc
0.89
lance
0.86
apest
0.85
Activations Density 0.141%