INDEX
    Explanations

    instances of the abbreviation "Gr" followed by a number, which likely indicates grades or scores

    New Auto-Interp
    Negative Logits
    an
    -0.32
    at
    -0.19
    iance
    -0.17
    anio
    -0.17
    anca
    -0.15
    a
    -0.15
    anou
    -0.15
    straint
    -0.14
    corr
    -0.14
    anlar
    -0.14
    POSITIVE LOGITS
    imes
    0.21
    uber
    0.20
    instead
    0.19
    ims
    0.18
    indle
    0.18
    imal
    0.18
     Gr
    0.18
    uner
    0.17
    ÑĢеб
    0.17
    illo
    0.17
    Act Density 0.008%

    No Known Activations