INDEX
Explanations
occurrences of the word "El" along with its variations
New Auto-Interp
Negative Logits
ynet
-0.16
o
-0.16
upertino
-0.15
ole
-0.15
l
-0.15
ously
-0.14
lum
-0.14
yp
-0.14
yper
-0.14
dit
-0.14
POSITIVE LOGITS
ora
0.20
odie
0.18
raith
0.18
kins
0.17
bow
0.16
ipse
0.16
ÃŃas
0.16
placeholders
0.15
Paso
0.15
izabeth
0.15
Activations Density 0.015%