INDEX
Explanations
references to spokespersons or official representatives in the text
New Auto-Interp
Negative Logits
inois
-0.17
ponge
-0.16
presso
-0.15
UFFIX
-0.14
azaar
-0.14
esser
-0.14
ÑĤон
-0.14
nebu
-0.14
-archive
-0.13
âb
-0.13
POSITIVE LOGITS
684
0.16
Bundy
0.15
/commons
0.14
enie
0.14
Cheryl
0.14
IDO
0.14
Paren
0.14
892
0.14
seudo
0.14
701
0.14
Activations Density 0.008%