INDEX
    Explanations

    references to individuals with notable achievements

    New Auto-Interp
    Negative Logits
     lesbische
    -0.15
    meer
    -0.15
     tavs
    -0.14
    rech
    -0.13
    uns
    -0.13
    /cpp
    -0.13
     Pra
    -0.13
    زÙĦ
    -0.13
     kvin
    -0.13
    lain
    -0.13
    POSITIVE LOGITS
    ldre
    0.18
     till
    0.17
    пÑĢимеÑĢ
    0.17
    emy
    0.16
     Sund
    0.16
    nad
    0.15
    æĬ¥
    0.15
    å
    0.15
    orna
    0.15
     tack
    0.15
    Act Density 0.143%

    No Known Activations