INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.57
     AudioClip
    -0.57
     Baillargeon
    -0.56
    ieważ
    -0.55
    RectangleBorder
    -0.54
    visející
    -0.53
     whereas
    -0.53
    '],'
    -0.53
    μών
    -0.51
    fraid
    -0.51
    POSITIVE LOGITS
     turn
    2.38
    turn
    1.97
    Turn
    1.70
     Turn
    1.63
     TURN
    1.59
     turns
    1.51
    TURN
    1.44
     turned
    1.28
    turns
    1.28
     turno
    1.19
    Act Density 0.248%

    No Known Activations