INDEX
Explanations
instances of shaking or nodding, particularly in relation to emotional responses
New Auto-Interp
Negative Logits
aná
-0.18
hone
-0.16
اÙĦدر
-0.15
unnable
-0.15
åĭĻ
-0.15
acting
-0.15
Backing
-0.15
agini
-0.14
ÃĹ↵↵
-0.14
awks
-0.14
POSITIVE LOGITS
ingly
0.26
ly
0.16
avian
0.15
øj
0.15
-fast
0.15
-proof
0.15
ring
0.15
oord
0.15
kel
0.14
htar
0.14
Activations Density 0.049%