INDEX
Explanations
phrases containing instructions or warnings
phrases that convey warnings or advice against certain actions
New Auto-Interp
Negative Logits
ãĤ¦ãĤ¹
-0.81
quickShipAvailable
-0.80
proving
-0.68
Reborn
-0.68
albeit
-0.68
assurances
-0.66
Crusade
-0.65
Preservation
-0.63
assures
-0.63
omen
-0.62
POSITIVE LOGITS
bother
1.23
underestimate
1.15
interfere
1.13
hesitate
1.08
worry
1.07
confuse
1.05
medd
1.03
disturb
1.00
discriminate
0.99
stray
0.99
Activations Density 0.174%