INDEX
    Explanations

    indicators of successful outcomes or positive trends

    New Auto-Interp
    Negative Logits
    Билгалдахарш
    -0.61
     otomatig
    -0.60
     Administrativna
    -0.56
    sidemargin
    -0.51
    Autoritní
    -0.50
    "}")
    -0.49
    TagMode
    -0.47
     ComVisible
    -0.47
    +#+#
    -0.46
    сылкі
    -0.46
    POSITIVE LOGITS
     praising
    0.49
    Praise
    0.47
    praise
    0.46
     Praise
    0.45
     praise
    0.45
     praised
    0.45
    Positive
    0.44
    proud
    0.41
     обна
    0.40
     praises
    0.39
    Act Density 0.043%

    No Known Activations