INDEX
Explanations
various forms of punctuation and non-word characters in the text
New Auto-Interp
Negative Logits
pie
-0.16
æĭ©
-0.15
Insecta
-0.15
↵↵
-0.15
ên
-0.14
alers
-0.14
erp
-0.14
PIO
-0.14
Rudd
-0.14
ideographic
-0.14
POSITIVE LOGITS
à¥ģà¤
0.16
ska
0.16
sian
0.15
735
0.15
524
0.14
rices
0.14
bmi
0.14
ÑĢоÑģ
0.14
Schultz
0.14
ersen
0.13
Activations Density 0.029%