INDEX
    Explanations

    references to candidates in political contexts

    New Auto-Interp
    Negative Logits
    ington
    -0.16
    oya
    -0.16
    baÅŁ
    -0.16
    undi
    -0.15
    rahim
    -0.15
    azor
    -0.15
    æł·çļĦ
    -0.15
    canf
    -0.15
    thing
    -0.15
    coming
    -0.15
    POSITIVE LOGITS
     who
    0.23
     whom
    0.22
    who
    0.21
    hip
    0.18
    hood
    0.17
    êµ°
    0.17
    ion
    0.17
    们
    0.16
    /app
    0.16
     pool
    0.16
    Act Density 0.050%

    No Known Activations