INDEX
Explanations
frequent occurrences of a specific letter or character
New Auto-Interp
Negative Logits
azi
-0.18
afe
-0.18
chter
-0.17
alt
-0.17
rique
-0.16
поÑĩ
-0.16
croll
-0.16
ales
-0.16
ubl
-0.16
altung
-0.15
POSITIVE LOGITS
ver
0.20
gren
0.18
est
0.17
lyph
0.16
ret
0.16
ez
0.15
overs
0.15
Chew
0.15
detail
0.15
estre
0.14
Activations Density 0.012%