INDEX
Explanations
names of public figures and officials
New Auto-Interp
Negative Logits
\OptionsResolver
-0.15
avl
-0.15
avana
-0.15
avor
-0.15
unos
-0.14
andy
-0.14
aval
-0.14
AttributeSet
-0.14
avia
-0.14
åĿĽ
-0.14
POSITIVE LOGITS
iser
0.18
igo
0.16
uts
0.16
ersh
0.15
erset
0.15
itsu
0.15
ĥ
0.14
iam
0.14
enta
0.14
588
0.14
Activations Density 0.007%