INDEX
Explanations
phrases containing the word "blamed" followed by a reason or entity
occurrences of the word "blamed."
New Auto-Interp
Negative Logits
ymph
-0.83
edom
-0.76
marked
-0.75
mental
-0.70
ouver
-0.70
birth
-0.70
atri
-0.69
inational
-0.68
ggies
-0.67
arent
-0.66
POSITIVE LOGITS
blamed
0.94
blames
0.87
blame
0.84
scapego
0.79
blaming
0.79
forgiven
0.77
attribut
0.70
ãĤ¼
0.69
culprit
0.67
victim
0.66
Activations Density 0.006%