INDEX
Explanations
phrases indicating restraint or inhibition
phrases indicating restraint or delay
New Auto-Interp
Negative Logits
²¾
-0.84
swick
-0.75
ivia
-0.74
aundering
-0.71
aters
-0.71
ioxide
-0.71
ceans
-0.69
apest
-0.68
ģ«
-0.66
iterranean
-0.66
POSITIVE LOGITS
hold
0.78
ransom
0.78
hold
0.77
tight
0.73
hostage
0.70
reins
0.70
sway
0.65
ledge
0.64
grip
0.64
plun
0.63
Activations Density 0.082%