INDEX
    Explanations

    actions related to flipping or rolling over

    New Auto-Interp
    Negative Logits
    новниш
    -0.47
     poveznice
    -0.46
    Đi
    -0.45
     Popis
    -0.45
    どうしても
    -0.45
     plads
    -0.44
     oublié
    -0.44
     maaf
    -0.44
    ்கு
    -0.43
     Po
    -0.43
    POSITIVE LOGITS
     flipped
    1.41
     flips
    1.41
     flipping
    1.39
     flip
    1.36
    flip
    1.26
    Flip
    1.24
     Flip
    1.19
     overturn
    1.15
    turning
    1.14
     Turning
    1.13
    Act Density 0.217%

    No Known Activations