INDEX
Explanations
instances where someone refuses to do something
instances of the word "refused"
New Auto-Interp
Negative Logits
Clean
-0.82
groups
-0.81
psc
-0.76
Day
-0.76
Assembly
-0.75
è¦ļéĨĴ
-0.75
mental
-0.74
GROUP
-0.73
day
-0.71
arya
-0.70
POSITIVE LOGITS
refused
0.93
refuse
0.93
ĸļ
0.88
refuses
0.87
vehemently
0.84
refusal
0.80
refusing
0.80
miser
0.79
repeatedly
0.77
unanimously
0.74
Activations Density 0.012%