INDEX
Explanations
direct speech or quotes in the text
New Auto-Interp
Negative Logits
ScreenState
-0.16
thon
-0.14
apan
-0.14
ibbean
-0.14
avel
-0.14
odyn
-0.14
arian
-0.14
licht
-0.14
odo
-0.14
lias
-0.14
POSITIVE LOGITS
defs
0.15
Gord
0.14
stup
0.13
Managed
0.13
XT
0.13
entai
0.13
anzi
0.13
даÑĤ
0.12
837
0.12
@[
0.12
Activations Density 0.034%