INDEX
Explanations
phrases indicating urgency and the necessity for action or decision-making
New Auto-Interp
Negative Logits
till
-0.16
umont
-0.14
enie
-0.14
mate
-0.14
an
-0.13
Pant
-0.13
ep
-0.13
trous
-0.13
columnist
-0.13
bitch
-0.13
POSITIVE LOGITS
otherwise
0.26
OTHERWISE
0.23
Otherwise
0.23
Otherwise
0.23
otherwise
0.19
åIJ¦
0.19
tir
0.16
udas
0.16
Cres
0.16
loquent
0.15
Activations Density 0.177%