INDEX
    Explanations

    terminology related to evaluations and outcomes

    New Auto-Interp
    Negative Logits
     increa
    -2.10
     affor
    -1.89
     encomp
    -1.89
     scrat
    -1.86
     unden
    -1.83
     desir
    -1.83
     impra
    -1.82
     guarante
    -1.81
     purcha
    -1.81
     inev
    -1.78
    POSITIVE LOGITS
    .
    0.78
     because
    0.74
     despite
    0.74
    ;
    0.74
     وأن
    0.70
     but
    0.70
    ".
    0.68
     while
    0.68
    0.67
    JTable
    0.66
    Act Density 0.914%

    No Known Activations