INDEX
Explanations
affirmations of belief or conviction
New Auto-Interp
Negative Logits
ar
-0.77
lot
-0.77
N
-0.73
y
-0.72
N
-0.70
lots
-0.69
SE
-0.67
us
-0.66
quad
-0.66
-0.65
POSITIVE LOGITS
BELIEVE
1.81
believe
1.77
believe
1.75
believes
1.70
believed
1.67
Believe
1.61
Believe
1.60
belie
1.58
Belief
1.54
Beliefs
1.54
Activations Density 0.077%