INDEX
    Explanations

    words indicating refusal or negation in decision-making contexts

    New Auto-Interp
    Negative Logits
     pinulongan
    -0.60
    GenerationType
    -0.57
    iecie
    -0.56
     zd
    -0.53
    soort
    -0.53
     Drapeau
    -0.52
    τρο
    -0.51
    leep
    -0.51
     focal
    -0.50
     Pin
    -0.50
    POSITIVE LOGITS
     melakukannya
    0.76
    devamını
    0.65
     fernández
    0.61
     aikaa
    0.60
    这样做
    0.59
     partecipare
    0.59
     solches
    0.58
     tričko
    0.58
     eccell
    0.58
     concernés
    0.58
    Act Density 0.421%

    No Known Activations