INDEX
Explanations
names or titles associated with success in various fields
New Auto-Interp
Negative Logits
iev
-0.17
uros
-0.16
pie
-0.16
oci
-0.16
LEC
-0.15
ahas
-0.14
stell
-0.14
stan
-0.14
nicos
-0.14
iele
-0.14
POSITIVE LOGITS
alty
0.25
alties
0.21
enne
0.18
eur
0.17
rides
0.17
ce
0.17
ÚĺÙĩ
0.17
sson
0.16
ne
0.16
otes
0.15
Activations Density 0.022%