INDEX
Explanations
actions involving falling or crashing
New Auto-Interp
Negative Logits
idge
-0.17
442
-0.15
erton
-0.15
endon
-0.14
_ABI
-0.14
دÙĩ
-0.13
quip
-0.13
tl
-0.13
apo
-0.13
aben
-0.13
POSITIVE LOGITS
onto
0.28
onto
0.23
landing
0.21
Landing
0.20
falling
0.20
èIJ
0.19
heap
0.19
landing
0.19
åľ¨åľ°
0.19
Ont
0.19
Activations Density 0.089%