INDEX
Explanations
instances of the word "guilt" and related sentiments
references to feelings of guilt
New Auto-Interp
Negative Logits
chan
-0.72
IFE
-0.70
andel
-0.69
psey
-0.69
urgical
-0.68
IPS
-0.68
989
-0.66
Occupations
-0.63
ANN
-0.63
RFC
-0.61
POSITIVE LOGITS
guilt
0.94
lessness
0.88
less
0.86
conscience
0.80
ibal
0.79
iness
0.79
worthiness
0.79
fulness
0.79
innocence
0.76
fully
0.75
Activations Density 0.031%