INDEX
Explanations
occurrences of the letter 'K' in varying contexts
New Auto-Interp
Negative Logits
anye
-0.23
aren
-0.22
ullan
-0.19
à¤Ī
-0.18
iller
-0.18
odi
-0.18
ingt
-0.17
ills
-0.17
ens
-0.17
ỳ
-0.16
POSITIVE LOGITS
esting
0.19
lags
0.19
inated
0.19
ucher
0.19
uo
0.17
orp
0.17
oster
0.16
loo
0.16
rick
0.16
edia
0.16
Activations Density 0.031%