INDEX
Explanations
the presence of specific words or phrases related to authority and governance
New Auto-Interp
Negative Logits
icast
-0.16
rosse
-0.16
elix
-0.15
YSTEM
-0.15
ells
-0.15
hus
-0.15
lix
-0.15
ICAST
-0.15
Ñĥжд
-0.15
ostream
-0.14
POSITIVE LOGITS
Ham
0.19
ham
0.18
craft
0.17
Craft
0.17
Craft
0.17
HAM
0.16
Southampton
0.16
Ham
0.16
craft
0.15
Gre
0.15
Activations Density 0.034%