INDEX
Explanations
phrases starting with the symbol "âĢ" and subsequent words
occurrences of specific symbols or characters in the text
New Auto-Interp
Negative Logits
hemor
-0.66
glac
-0.65
shroud
-0.64
transported
-0.63
board
-0.63
Borough
-0.62
range
-0.61
semblance
-0.60
Manhattan
-0.60
Dresden
-0.60
POSITIVE LOGITS
ª
1.33
¹
1.30
³
1.22
ı
1.22
ł
1.21
¦
1.15
«
1.15
¾
1.11
¸
1.11
©
1.10
Activations Density 0.075%