INDEX
Explanations
punctuation and common auxiliary verbs in the text
New Auto-Interp
Negative Logits
ampa
-0.15
_simps
-0.15
ÂŃi
-0.14
ÚĨØ´Ùħ
-0.14
loid
-0.14
080
-0.14
stadt
-0.14
Pearce
-0.13
uler
-0.13
ÙĦÙĥرة
-0.13
POSITIVE LOGITS
Cove
0.15
circles
0.15
uncture
0.14
isseur
0.14
aten
0.14
üf
0.14
itant
0.14
avia
0.13
Fallback
0.13
βά
0.13
Activations Density 0.513%