INDEX
Explanations
related to the concept of rejection or refusal
terms related to the act of rejecting or refusal
New Auto-Interp
Negative Logits
brance
-0.77
ussen
-0.75
vern
-0.73
inki
-0.69
llular
-0.69
andise
-0.69
ixtape
-0.68
breaker
-0.68
isd
-0.67
dro
-0.65
POSITIVE LOGITS
outright
0.84
hypotheses
0.79
suggestions
0.78
hap
0.77
excuses
0.76
REF
0.74
pleas
0.74
attempts
0.72
responsibility
0.69
propositions
0.69
Activations Density 0.052%