INDEX
Explanations
words that express blame or responsibility
instances of the word "blame" and its related forms
New Auto-Interp
Negative Logits
intend
-0.81
quire
-0.72
tein
-0.72
chedel
-0.69
weeney
-0.68
cephal
-0.68
ierre
-0.67
mental
-0.66
naissance
-0.66
ser
-0.65
POSITIVE LOGITS
blame
1.15
scapego
0.95
blames
0.90
blaming
0.90
squarely
0.86
blamed
0.81
victim
0.80
culprit
0.79
obsc
0.72
victims
0.71
Activations Density 0.046%