INDEX
    Explanations

    sentences that express negative sentiments or identify problems.

    New Auto-Interp
    Negative Logits
     problem
    -0.84
    problem
    -0.83
     problème
    -0.80
     problems
    -0.79
    Probl
    -0.78
     Problem
    -0.77
     problém
    -0.77
    Problem
    -0.77
    PROBLEM
    -0.76
     problema
    -0.74
    POSITIVE LOGITS
    Autoritní
    0.73
    RTLR
    0.70
    <bos>
    0.66
     Meksiku
    0.56
    erweise
    0.54
    олові
    0.52
    ături
    0.50
     personalizar
    0.50
    pierw
    0.50
     saites
    0.49
    Act Density 36.081%

    No Known Activations