INDEX
Explanations
actions and movements occurring in various contexts
New Auto-Interp
Negative Logits
upside
-0.19
outright
-0.14
cki
-0.14
541
-0.14
flatt
-0.13
icina
-0.13
aris
-0.13
ÑģпÑĸлÑĮ
-0.13
near
-0.13
uri
-0.13
POSITIVE LOGITS
forward
0.47
Forward
0.36
onto
0.35
past
0.34
FORWARD
0.34
into
0.33
forward
0.32
_forward
0.32
onto
0.31
Forward
0.31
Activations Density 0.727%