INDEX
Explanations
negatives emphasizing the quality or nature of something being unfavorable
Comes after "not"
New Auto-Interp
Negative Logits
ConstraintMaker
-0.55
NSCoder
-0.55
ivelany
-0.55
duquel
-0.53
absolutely
-0.51
しまいます
-0.50
しまう
-0.49
finally
-0.49
Boring
-0.49
ってしまう
-0.48
POSITIVE LOGITS
pleasant
0.92
good
0.81
conducive
0.72
bueno
0.71
welcome
0.71
desirable
0.71
nice
0.69
ideal
0.68
favorable
0.67
bode
0.66
Activations Density 0.255%