INDEX
Explanations
instances of discourse markers or conversational cues
New Auto-Interp
Negative Logits
esson
-0.15
ogo
-0.15
interest
-0.15
ishi
-0.15
cean
-0.15
wers
-0.15
ؤ
-0.15
rella
-0.14
asu
-0.14
chein
-0.14
POSITIVE LOGITS
USTOM
0.14
gu
0.14
dair
0.14
XObject
0.14
currentColor
0.13
erce
0.13
InputGroup
0.13
ers
0.13
IES
0.13
tol
0.13
Activations Density 0.004%