INDEX
    Explanations

    words and phrases expressing negativity, failure, or wrongdoing on a large scale

    New Auto-Interp
    Negative Logits
     оригіналу
    -0.66
    تقاوى
    -0.65
     AssemblyCulture
    -0.64
     peor
    -0.58
     peores
    -0.57
     فريبيس
    -0.57
    oredCriteria
    -0.56
     mauvaise
    -0.56
    tagHelperRunner
    -0.54
    Worse
    -0.54
    POSITIVE LOGITS
     незавершена
    0.59
    0.56
     autorytatywna
    0.56
    ratulations
    0.55
    HasAnnotation
    0.52
    getragen
    0.51
     actionMode
    0.50
    cheme
    0.49
    点了点头
    0.48
    ulage
    0.47
    Act Density 4.029%

    No Known Activations