INDEX
Explanations
instances of quotes and dialogue in the text
New Auto-Interp
Negative Logits
umper
-0.15
ingo
-0.15
Ïģοι
-0.15
'hui
-0.14
defaultMessage
-0.14
yyn
-0.14
ç¿°
-0.14
'na
-0.14
gst
-0.13
ories
-0.13
POSITIVE LOGITS
ve
0.39
ll
0.37
ve
0.32
ll
0.29
LL
0.27
re
0.26
VE
0.26
d
0.24
.ll
0.23
_ll
0.23
Activations Density 0.016%