INDEX
Explanations
Italian feminine articles and prepositions
New Auto-Interp
Negative Logits
quina
-0.14
rap
-0.14
ista
-0.14
ppelin
-0.14
anche
-0.14
Bark
-0.13
dana
-0.13
ilin
-0.13
tipping
-0.13
hall
-0.13
POSITIVE LOGITS
ayah
0.17
ekt
0.17
commend
0.16
andel
0.16
CTR
0.15
ngen
0.15
instanc
0.14
ught
0.14
ingular
0.14
IDX
0.14
Activations Density 0.014%