INDEX
Explanations
Italian and Spanish words and names
New Auto-Interp
Negative Logits
etheless
-0.77
INGTON
-0.76
interrupted
-0.70
oola
-0.69
iaries
-0.69
orage
-0.67
hips
-0.67
ourcing
-0.64
otos
-0.64
abies
-0.64
POSITIVE LOGITS
lla
1.51
ller
1.42
llers
1.42
lli
1.39
lda
1.37
gger
1.31
zza
1.31
xt
1.31
lling
1.31
ppo
1.26
Activations Density 4.423%