INDEX
Explanations
specific characters or symbols that may indicate formatting in text
New Auto-Interp
Negative Logits
desk
-0.16
752
-0.16
Immigration
-0.15
asp
-0.15
ÂŃs
-0.15
ÃĹ↵↵
-0.15
ilion
-0.15
ody
-0.15
742
-0.14
iesen
-0.14
POSITIVE LOGITS
»
0.33
¿
0.32
¼
0.28
¾
0.26
½
0.23
½Ķ
0.21
Bain
0.18
alom
0.17
Ê
0.17
mrb
0.16
Activations Density 0.004%