INDEX
Explanations
phrases related to assigning blame or fault
references to assigning blame in various contexts
New Auto-Interp
Negative Logits
chan
-0.78
forms
-0.75
marked
-0.75
intend
-0.74
irl
-0.72
fo
-0.72
mare
-0.70
mens
-0.69
miss
-0.67
mental
-0.65
POSITIVE LOGITS
blame
0.92
blaming
0.85
forgiven
0.77
blames
0.75
squarely
0.65
forgiveness
0.64
awaru
0.62
blamed
0.61
oshop
0.60
fulness
0.60
Activations Density 0.016%