INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    awtextra
    -0.73
     שוליים
    -0.67
     disambiguazione
    -0.65
    seamnă
    -0.64
    mità
    -0.63
    bellar
    -0.63
     Efq
    -0.62
    Външни
    -0.60
     Hima
    -0.59
     indietro
    -0.57
    POSITIVE LOGITS
     turn
    0.65
    turn
    0.56
     unmute
    0.55
     toughest
    0.52
     turning
    0.51
     Turn
    0.51
    Turn
    0.50
    cob
    0.50
     turns
    0.48
     TURN
    0.48
    Act Density 0.086%

    No Known Activations