INDEX
Explanations
words related to feelings of guilt and remorse
references to feelings of guilt and remorse
New Auto-Interp
Negative Logits
Amph
-0.77
emer
-0.75
lar
-0.69
idian
-0.69
Systems
-0.68
CM
-0.67
Table
-0.64
ature
-0.63
ks
-0.62
marine
-0.62
POSITIVE LOGITS
guilt
3.93
remorse
1.80
innocence
1.62
guilty
1.57
culp
1.49
shame
1.47
blame
1.43
Guilty
1.33
conscience
1.25
resentment
1.23
Activations Density 0.016%