INDEX
    Explanations

    instances of personal and community references in the context of events or actions

    New Auto-Interp
    Negative Logits
    iol
    -0.16
    tte
    -0.15
    idebar
    -0.15
    опол
    -0.15
     Kis
    -0.15
    duk
    -0.14
    uw
    -0.14
    éŀ
    -0.14
     Aub
    -0.14
     fandom
    -0.14
    POSITIVE LOGITS
    wards
    0.16
    HAV
    0.16
    اطر
    0.16
    ÑĢиÑĩ
    0.15
    ÑĦÑĦ
    0.15
    aminer
    0.14
    बर
    0.14
    otel
    0.14
    emos
    0.14
     ÙħÛĮÙĦادÛĮ
    0.14
    Act Density 0.067%

    No Known Activations