INDEX
Explanations
instances of specific names and associated identifiers
New Auto-Interp
Negative Logits
cejas
-0.77
rantai
-0.75
almohada
-0.74
sorpresas
-0.73
ladrillo
-0.71
desmotivaciones
-0.70
manguera
-0.70
agujas
-0.69
rajut
-0.69
botellas
-0.68
POSITIVE LOGITS
Y
0.79
N
0.78
H
0.78
D
0.77
V
0.77
Z
0.76
G
0.76
B
0.75
K
0.75
M
0.75
Activations Density 0.769%