INDEX
Explanations
proper nouns related to individuals
names associated with individuals or entities in a context of inquiry or questioning
New Auto-Interp
Negative Logits
harness
-0.67
comp
-0.67
MM
-0.66
author
-0.64
bands
-0.63
GB
-0.61
toy
-0.60
CR
-0.60
Dru
-0.60
Bod
-0.59
POSITIVE LOGITS
osta
4.90
chio
1.37
zona
1.09
kson
1.08
uador
1.05
acia
1.04
endi
1.00
raltar
0.99
oola
0.98
ost
0.97
Activations Density 0.019%