INDEX
Explanations
occurrences of specific symbols or characters, potentially indicating formatting or encoding elements
New Auto-Interp
Negative Logits
ÂĹ
-0.18
connexion
-0.16
specialised
-0.15
,—
-0.15
'.
-0.15
/↵
-0.15
(«
-0.15
ÃIJ
-0.14
organisers
-0.14
-↵↵
-0.14
POSITIVE LOGITS
Marcus
0.25
Young
0.23
Marcus
0.20
–
0.20
Young
0.16
upstream
0.15
young
0.15
today
0.15
documented
0.15
DNA
0.14
Activations Density 0.003%