INDEX
    Explanations

    mentions of individuals or groups of people in various contexts

    New Auto-Interp
    Negative Logits
    clude
    -0.14
     ning
    -0.14
    öz
    -0.14
     something
    -0.14
    isode
    -0.14
    åħ¶ä¸Ń
    -0.13
    agus
    -0.13
    makt
    -0.13
    ssf
    -0.13
    êtes
    -0.13
    POSITIVE LOGITS
     who
    0.31
     everywhere
    0.30
    who
    0.25
     worldwide
    0.21
     across
    0.21
     Who
    0.18
     Everywhere
    0.18
    Who
    0.18
     whose
    0.17
    ino
    0.16
    Act Density 0.296%

    No Known Activations