INDEX
Explanations
actions related to moving or transitioning between states
New Auto-Interp
Negative Logits
whereas
-0.18
but
-0.18
ostel
-0.17
oraz
-0.16
bler
-0.15
ï¼ĮèĢĮä¸Ķ
-0.15
PLUS
-0.15
lẫn
-0.15
ä½Ĩ
-0.15
ãģ»
-0.15
POSITIVE LOGITS
and
0.33
ÙĪØª
0.25
vÃł
0.24
and
0.22
и
0.22
and
0.21
AndGet
0.21
à¹ģละ
0.20
ãģ¨
0.20
ÙĪØ¥
0.19
Activations Density 0.611%