INDEX
Explanations
instances of the conjunction "but" and punctuation
New Auto-Interp
Negative Logits
alth
-0.17
UBE
-0.15
olumbia
-0.15
ei
-0.15
Columbia
-0.14
WO
-0.14
illaume
-0.14
foot
-0.14
Flor
-0.14
анÑģи
-0.14
POSITIVE LOGITS
(~(
0.14
ibri
0.14
pell
0.14
ÙĨÚ¯
0.14
ÃĹ↵↵
0.13
elsen
0.13
apel
0.13
_consts
0.13
ảo
0.13
919
0.13
Activations Density 0.150%