INDEX
Explanations
names with a focus on the individual "Dani" and "Paula."
New Auto-Interp
Negative Logits
ités
-0.15
engo
-0.15
asher
-0.15
Airways
-0.14
asco
-0.14
iece
-0.14
udo
-0.14
Controlled
-0.14
etimes
-0.14
éŁ¿
-0.14
POSITIVE LOGITS
ela
0.19
nel
0.16
elsen
0.16
utable
0.15
.jackson
0.14
วล
0.14
PK
0.14
ç«ĭãģ¦
0.14
èle
0.14
plete
0.14
Activations Density 0.021%