INDEX
Explanations
occurrences of the letter 'h'
New Auto-Interp
Negative Logits
orous
-0.15
arch
-0.15
hai
-0.14
unt
-0.14
_variant
-0.14
rych
-0.14
è¦Ĩ
-0.13
esty
-0.13
Ïģε
-0.13
oly
-0.13
POSITIVE LOGITS
ufig
0.16
afen
0.16
oft
0.16
eden
0.16
tte
0.15
raj
0.15
of
0.15
OF
0.15
trand
0.14
ogg
0.14
Activations Density 0.027%