INDEX
Explanations
instances of the word "guilt" or related phrases indicating a feeling of guilt
references to guilt and remorse
New Auto-Interp
Negative Logits
psey
-0.71
Occupations
-0.69
andel
-0.68
chan
-0.67
IPS
-0.64
openings
-0.62
livest
-0.62
66666666
-0.62
IFE
-0.62
CFR
-0.62
POSITIVE LOGITS
lessness
0.93
guilt
0.91
less
0.87
fulness
0.86
fully
0.84
worthiness
0.81
lessly
0.76
iness
0.75
conscience
0.74
faced
0.74
Activations Density 0.023%