INDEX
Explanations
phrases related to significant changes or actions
New Auto-Interp
Negative Logits
irré
-0.74
Jumping
-0.65
jumping
-0.64
Falling
-0.61
constamment
-0.60
touching
-0.60
Datuak
-0.60
jumping
-0.57
personlig
-0.57
bailando
-0.56
POSITIVE LOGITS
walk
1.09
rise
1.07
hike
1.04
roll
1.02
move
0.99
climb
0.98
turn
0.94
drop
0.92
visit
0.90
push
0.90
Activations Density 0.522%