INDEX
    Explanations

    uppercase letters or acronyms in the document

    New Auto-Interp
    Negative Logits
     ILCS
    -0.72
    -+-+
    -0.69
     stru
    -0.66
    ãĤ©
    -0.66
     unsupported
    -0.65
     Sweeney
    -0.65
     Bere
    -0.64
    à¤
    -0.63
    \/\/
    -0.63
     TABLE
    -0.63
    POSITIVE LOGITS
    tg
    0.83
    yu
    0.83
    tarian
    0.79
    Fi
    0.79
    idian
    0.76
    vP
    0.75
    ymes
    0.74
    dL
    0.74
    yne
    0.72
    Ha
    0.72
    Act Density 0.064%

    No Known Activations