INDEX
Explanations
affirmations and expressions of agreement
New Auto-Interp
Negative Logits
oplay
-0.16
idot
-0.15
rchive
-0.15
.bz
-0.15
QRS
-0.15
eldorf
-0.14
945
-0.13
aña
-0.13
iders
-0.13
tec
-0.13
POSITIVE LOGITS
yes
0.45
correct
0.41
yes
0.40
Yes
0.36
right
0.34
yup
0.34
Yep
0.33
Yup
0.31
Yep
0.31
Yes
0.30
Activations Density 0.072%