INDEX
    Explanations

    references to specific individuals named Sher or related terms

    New Auto-Interp
    Negative Logits
    hra
    -0.17
    usc
    -0.16
    INU
    -0.15
    cale
    -0.14
    avar
    -0.14
     ç¼
    -0.14
    599
    -0.13
    ακ
    -0.13
    bjerg
    -0.13
    ASN
    -0.13
    POSITIVE LOGITS
    ects
    0.19
    iffs
    0.17
    ect
    0.17
    ldr
    0.16
    inkle
    0.16
    esz
    0.16
    pherd
    0.15
    Ñĥки
    0.15
    don
    0.15
    emet
    0.14
    Act Density 0.010%

    No Known Activations