INDEX
Explanations
words related to warfare
references to the concept of "war"
New Auto-Interp
Negative Logits
essee
-0.95
sembly
-0.90
Ħ¢
-0.84
aminer
-0.82
ħĭ
-0.82
İĭ
-0.81
etsk
-0.75
aunder
-0.72
hiba
-0.70
issance
-0.69
POSITIVE LOGITS
rior
1.29
fare
1.27
lords
1.22
riors
1.21
lord
1.17
locks
1.03
ped
0.94
ring
0.94
ping
0.92
bands
0.88
Activations Density 0.029%