INDEX
    Explanations

    references to audience and societal groups

    New Auto-Interp
    Negative Logits
    vek
    -0.07
    arto
    -0.07
    660
    -0.07
     unto
    -0.07
     ÑģобÑĸ
    -0.06
    lopen
    -0.06
    ici
    -0.06
    İ
    -0.06
    åΰ
    -0.06
    ICI
    -0.06
    POSITIVE LOGITS
     about
    0.14
    about
    0.12
     tentang
    0.11
     دربارÙĩ
    0.10
    _about
    0.10
     regarding
    0.10
    åħ³äºİ
    0.10
     concerning
    0.09
    -about
    0.09
     About
    0.09
    Act Density 0.060%

    No Known Activations