INDEX
    Explanations

    references to scoring or ratings in various contexts

    New Auto-Interp
    Negative Logits
     onCancelled
    -0.74
    AlterField
    -0.72
     препратки
    -0.72
    =""/>
    -0.70
     herum
    -0.69
    isnan
    -0.69
    zewod
    -0.67
    Gweler
    -0.67
     hinein
    -0.67
    Hyde
    -0.66
    POSITIVE LOGITS
     scores
    1.88
     score
    1.79
     Scores
    1.74
     scored
    1.68
     Score
    1.59
    Scores
    1.56
    scores
    1.54
     scoring
    1.54
    score
    1.50
    Score
    1.47
    Act Density 0.039%

    No Known Activations