INDEX
Explanations
connections between events or situations and the act of assigning blame or responsibility
phrases that denote blame or attribution
New Auto-Interp
Negative Logits
ecast
-0.75
alde
-0.71
adia
-0.71
ibaba
-0.68
DH
-0.67
workplaces
-0.65
MW
-0.65
floors
-0.65
cludes
-0.65
rentices
-0.62
POSITIVE LOGITS
blame
0.80
à¨
0.76
Ö¼
0.69
culprit
0.68
Allaah
0.68
bluff
0.68
ãĤ§
0.67
acronym
0.67
è¦ļéĨĴ
0.66
̶
0.64
Activations Density 0.133%