INDEX
Explanations
phrases involving the word "believe"
expressions of belief and skepticism
New Auto-Interp
Negative Logits
yna
-0.71
ack
-0.70
apeake
-0.70
nice
-0.67
amen
-0.63
pain
-0.63
spr
-0.62
nec
-0.61
hens
-0.61
conservancy
-0.60
POSITIVE LOGITS
ieve
0.93
iever
0.77
fulness
0.76
ievers
0.75
ieving
0.74
rill
0.74
itious
0.72
sincerity
0.70
passionately
0.70
believing
0.69
Activations Density 0.047%