INDEX
    Explanations

    mentions of medals and awards in sports contexts

    New Auto-Interp
    Negative Logits
    airo
    -0.18
    itesse
    -0.14
    ilty
    -0.14
    gebung
    -0.14
    ä¿Ĭ
    -0.14
     Garland
    -0.14
    ORMAL
    -0.14
    taÅŁ
    -0.14
    alarından
    -0.14
    orris
    -0.13
    POSITIVE LOGITS
     Wolfe
    0.15
    yz
    0.15
    iless
    0.14
    anan
    0.14
    ibri
    0.14
    ocl
    0.14
     sung
    0.14
    ersiz
    0.14
    ustral
    0.14
    incible
    0.13
    Act Density 0.021%

    No Known Activations