INDEX
    Explanations

    names or abbreviation initials followed by a particular letter grade

    proper nouns, specifically names and titles

    New Auto-Interp
    Negative Logits
    umbn
    -0.69
    saf
    -0.68
    éĹĺ
    -0.66
    ModLoader
    -0.65
    oided
    -0.64
    caps
    -0.63
    thirds
    -0.63
    Fair
    -0.62
    availability
    -0.60
    Reviewer
    -0.59
    POSITIVE LOGITS
    .?
    0.89
    .,
    0.82
    .:
    0.73
    ./
    0.72
    .;
    0.71
    ullivan
    0.67
     Armour
    0.64
    .,"
    0.63
    uce
    0.63
    #$
    0.63
    Act Density 0.064%

    No Known Activations