INDEX
Explanations
phrases related to acquisition or receiving something
New Auto-Interp
Negative Logits
olf
-0.16
bottoms
-0.14
зÑĥ
-0.14
Stef
-0.14
nobody
-0.14
eno
-0.14
opaque
-0.13
un
-0.13
461
-0.13
ugh
-0.13
POSITIVE LOGITS
oire
0.18
chas
0.16
estar
0.16
ENTE
0.15
볨
0.15
tlement
0.15
Darling
0.15
ekt
0.15
ascus
0.14
loyd
0.14
Activations Density 0.132%