INDEX
Explanations
words related to refusal or defiance
instances of refusal or rejection
New Auto-Interp
Negative Logits
soDeliveryDate
-1.04
lined
-0.80
quickShipAvailable
-0.77
soType
-0.75
////////////////////////////////
-0.75
nin
-0.74
largeDownload
-0.71
oward
-0.68
gradient
-0.68
retty
-0.67
POSITIVE LOGITS
accept
1.14
cooperate
1.14
obey
1.10
heed
1.09
acknowledge
1.09
bud
1.06
comply
1.05
endorse
1.02
reproduce
1.01
recognize
1.01
Activations Density 0.048%