INDEX
Explanations
references to military ranks and figures
mentions of military ranks, specifically "Colonel."
New Auto-Interp
Negative Logits
Pengu
-0.75
sett
-0.73
lins
-0.72
irlf
-0.70
itar
-0.69
thora
-0.69
elf
-0.68
robat
-0.66
Chaser
-0.64
pun
-0.64
POSITIVE LOGITS
Gaddafi
0.86
LECT
0.85
onel
0.82
terday
0.75
ĸļ
0.75
xual
0.74
agne
0.72
ktop
0.70
ength
0.69
trl
0.68
Activations Density 0.045%