INDEX
Explanations
mentions of wars and military references
references to various types of wars
New Auto-Interp
Negative Logits
ĸļ
-0.93
sembly
-0.76
essee
-0.71
Ħ¢
-0.70
İĭ
-0.69
tremend
-0.69
URES
-0.68
aminer
-0.66
kindred
-0.66
extrad
-0.64
POSITIVE LOGITS
riors
1.40
rior
1.31
fare
1.31
war
1.24
ring
1.13
lords
0.92
fighter
0.92
lord
0.90
bucks
0.90
far
0.87
Activations Density 0.006%