INDEX
Explanations
actions and movements described in dynamic or physical terms
New Auto-Interp
Negative Logits
undy
-0.15
itre
-0.15
igo
-0.15
ãĥĥãĥĦ
-0.15
trace
-0.14
ä¹ĭä¸Ģ
-0.14
stasy
-0.14
.rs
-0.13
Russo
-0.13
opus
-0.13
POSITIVE LOGITS
into
0.24
forth
0.23
away
0.23
into
0.18
away
0.16
past
0.16
off
0.16
Away
0.16
toward
0.16
ingly
0.16
Activations Density 0.090%