INDEX
Explanations
punctuation and markers that indicate speech or quotes
New Auto-Interp
Negative Logits
âĢŀ
-0.23
“â̦
-0.23
(“
-0.23
`),↵
-0.17
okt
-0.16
******************************************************************************↵
-0.16
“
-0.16
htub
-0.15
(«
-0.15
sonian
-0.15
POSITIVE LOGITS
,"
0.21
":
0.18
â̳
0.18
"↵
0.17
"↵↵
0.16
","
0.14
ä
0.14
AndWait
0.14
":"
0.14
":""
0.14
Activations Density 0.299%