INDEX
Explanations
references to military invasions or occupations
New Auto-Interp
Negative Logits
peer
-0.76
uctor
-0.73
spot
-0.70
mpeg
-0.67
MpServer
-0.66
ube
-0.65
ruff
-0.64
etooth
-0.64
Scient
-0.64
trust
-0.64
POSITIVE LOGITS
invasion
0.81
Yamato
0.79
armies
0.79
force
0.77
naires
0.76
warfare
0.74
Invasion
0.74
forces
0.73
lord
0.69
Barbarian
0.69
Activations Density 0.050%