INDEX
Explanations
references to blame and accountability
associated with assigning responsibility
assigning blame or responsibility
New Auto-Interp
Negative Logits
المناصب
-0.65
myſelf
-0.64
MessageOf
-0.62
mitives
-0.62
Monfieur
-0.61
wiſe
-0.60
thday
-0.60
purpoſe
-0.59
ientôt
-0.59
pleaſure
-0.59
POSITIVE LOGITS
blame
1.48
blaming
1.36
blames
1.29
blamed
1.26
blame
1.24
Blame
1.22
Blame
1.11
accusing
0.92
culpa
0.87
attribution
0.87
Activations Density 0.632%