INDEX
    Explanations

    terms and phrases related to numerical values and scoring

    New Auto-Interp
    Negative Logits
    517
    -0.14
    itters
    -0.14
    ced
    -0.14
    amina
    -0.14
     COPYING
    -0.14
    ubu
    -0.14
     courtesy
    -0.14
    æµ®
    -0.14
     GOT
    -0.14
    birth
    -0.13
    POSITIVE LOGITS
    yne
    0.17
    errat
    0.16
     Freeman
    0.14
    èĨ
    0.14
    erval
    0.14
     Convention
    0.14
    ORTH
    0.14
    886
    0.13
    orth
    0.13
    rok
    0.13
    Act Density 0.007%

    No Known Activations