INDEX
Explanations
instances of the prefix "Fel" or related derivatives
New Auto-Interp
Negative Logits
i
-0.17
le
-0.16
enzie
-0.15
lle
-0.15
eur
-0.15
aged
-0.15
ythe
-0.15
aged
-0.15
a
-0.15
æŃ£
-0.15
POSITIVE LOGITS
icity
0.24
icit
0.19
Fel
0.18
Fel
0.18
ician
0.18
spar
0.16
ipa
0.16
sted
0.16
thy
0.16
ty
0.15
Activations Density 0.006%