INDEX
Explanations
mentions of war-related terms or concepts
references to "war" in various contexts
New Auto-Interp
Negative Logits
FORMATION
-0.88
Ħ¢
-0.83
essee
-0.82
aminer
-0.74
sembly
-0.74
ostics
-0.73
AUT
-0.70
Asset
-0.70
prints
-0.69
Distance
-0.68
POSITIVE LOGITS
fare
1.21
lords
1.14
riors
1.10
lord
1.08
rior
1.07
waged
0.96
fighting
0.96
ring
0.94
fighter
0.90
fighters
0.82
Activations Density 0.030%