INDEX
    Explanations

    negative sentiments or expressions of doubt and denial

    negative 'n't contractions

    New Auto-Interp
    Negative Logits
     nahilalakip
    -0.65
    rénées
    -0.65
     autorytatywna
    -0.63
     kasarigan
    -0.62
     Lösungen
    -0.60
     Autorisations
    -0.60
    gnore
    -0.59
    征詢我
    -0.59
    oa̍t
    -0.58
     surla
    -0.57
    POSITIVE LOGITS
     my
    0.31
    []{"
    0.27
     guys
    0.26
     Phen
    0.26
     crazy
    0.26
    ± 
    0.26
    fVar
    0.25
     (@
    0.25
     bit
    0.25
     Hey
    0.25
    Act Density 0.085%

    No Known Activations