INDEX
    Explanations

    actions related to moving or transitioning between states

    New Auto-Interp
    Negative Logits
     whereas
    -0.18
     but
    -0.18
    ostel
    -0.17
     oraz
    -0.16
    bler
    -0.15
    ï¼ĮèĢĮä¸Ķ
    -0.15
     PLUS
    -0.15
     lẫn
    -0.15
    ä½Ĩ
    -0.15
    ãģ»
    -0.15
    POSITIVE LOGITS
     and
    0.33
     ÙĪØª
    0.25
     vÃł
    0.24
    	and
    0.22
     и
    0.22
    and
    0.21
    AndGet
    0.21
    à¹ģละ
    0.20
    ãģ¨
    0.20
     ÙĪØ¥
    0.19
    Act Density 0.611%

    No Known Activations