INDEX
Explanations
words and phrases related to actions and performances
New Auto-Interp
Negative Logits
someone
-0.22
someone
-0.21
somebody
-0.20
ä¸Ģ个人
-0.20
sebuah
-0.20
Someone
-0.19
alguien
-0.18
si
-0.17
Someone
-0.17
htar
-0.15
POSITIVE LOGITS
quite
0.23
such
0.21
Quite
0.20
quite
0.18
SUCH
0.18
amore
0.17
somewhat
0.15
pend
0.14
arg
0.14
[
0.14
Activations Density 0.300%