INDEX
Explanations
terms related to warfare, such as 'warfare', 'deterrence', and 'weaponry'
references to warfare and military strategies
New Auto-Interp
Negative Logits
icles
-0.78
gow
-0.76
umen
-0.74
este
-0.73
abet
-0.73
otto
-0.70
angelo
-0.69
val
-0.69
ocent
-0.68
ergy
-0.68
POSITIVE LOGITS
fare
1.07
rior
0.89
Warfare
0.88
riors
0.85
warfare
0.80
hysteria
0.75
waged
0.74
Royale
0.73
simulation
0.72
battlefield
0.71
Activations Density 0.021%