INDEX
Explanations
verbs associated with refusal or denial
instances of the contraction "won't"
New Auto-Interp
Negative Logits
Kings
-0.68
culated
-0.68
itiz
-0.64
illustration
-0.63
soType
-0.62
Floating
-0.59
FontSize
-0.58
Anat
-0.58
Metallic
-0.58
contrasting
-0.58
POSITIVE LOGITS
necessarily
0.98
ember
0.84
rue
0.82
entimes
0.78
bud
0.78
erest
0.77
payers
0.77
acters
0.75
angular
0.75
ardless
0.74
Activations Density 0.036%