INDEX
Explanations
actions related to movement and physical interaction
New Auto-Interp
Head Attr Weights
0:0.07
1:0.02
2:0.07
3:0.04
4:0.04
5:0.10
6:0.02
7:0.02
8:0.40
9:0.03
10:0.09
11:0.06
Negative Logits
sche
-1.51
PV
-1.49
reviewer
-1.47
umar
-1.46
BJ
-1.39
additive
-1.35
olia
-1.35
ym
-1.34
olin
-1.33
Adds
-1.33
POSITIVE LOGITS
resp
1.69
Emer
1.60
iesta
1.59
istg
1.53
ラ
1.52
gust
1.50
mes
1.50
BUS
1.49
TAMADRA
1.49
ESS
1.44
Activations Density 0.469%