INDEX
Explanations
phrases related to intentions or desires
repeated expressions of desire or reluctance to do something
New Auto-Interp
Negative Logits
Enhancement
-0.72
issance
-0.71
strength
-0.71
ements
-0.68
Excellent
-0.67
Compass
-0.64
Smooth
-0.64
Kinnikuman
-0.63
Apart
-0.61
Integrity
-0.60
POSITIVE LOGITS
offend
1.23
jeopard
1.13
spoil
1.13
disturb
1.10
interfere
1.09
lose
1.09
waste
1.08
disappoint
1.05
incur
1.05
bother
1.04
Activations Density 0.067%