INDEX
Explanations
actions related to climbing or ascending
New Auto-Interp
Negative Logits
alley
-0.18
downward
-0.17
arme
-0.15
éľĩ
-0.15
ingleton
-0.14
nero
-0.14
IonicPage
-0.14
rina
-0.14
-0.14
HB
-0.14
POSITIVE LOGITS
Mount
0.22
Mt
0.21
stairs
0.21
aboard
0.20
ladder
0.19
onto
0.19
Jacob
0.18
Jacob
0.18
Mount
0.18
Kil
0.18
Activations Density 0.017%