INDEX
    Explanations

    negative expressions or phrases, particularly those related to situations of loss or disappointment

    New Auto-Interp
    Negative Logits
     Efq
    -1.09
    aarrggbb
    -1.00
     nahilalakip
    -0.96
     Winaray
    -0.85
     Italijani
    -0.85
     Majefty
    -0.82
     auffi
    -0.80
    SourceChecksum
    -0.75
     joaat
    -0.74
     Monfieur
    -0.73
    POSITIVE LOGITS
     Out
    0.72
     out
    0.69
     OUT
    0.68
    Out
    0.62
    out
    0.54
     estekak
    0.53
    OUT
    0.52
     Auss
    0.48
      
    0.45
    utate
    0.44
    Act Density 0.095%

    No Known Activations