INDEX
Explanations
terms related to wartime experiences and military actions
New Auto-Interp
Negative Logits
Garrison
-0.17
ksi
-0.15
arbon
-0.15
Ally
-0.14
armed
-0.14
uzzle
-0.14
defense
-0.13
edd
-0.13
uni
-0.13
intrigue
-0.13
POSITIVE LOGITS
kad
0.15
createState
0.14
lej
0.14
Ñĩа
0.14
å¿Ĺ
0.14
stav
0.13
idan
0.13
priv
0.13
Consolid
0.13
deb
0.13
Activations Density 0.189%