INDEX
    Explanations

    proper nouns and individuals mentioned in political contexts

    New Auto-Interp
    Negative Logits
    æ°¸ä¹ħ
    -0.07
    /fw
    -0.06
    oog
    -0.06
    ãĤĬãģ¨
    -0.06
    lobals
    -0.06
     Orn
    -0.06
    lette
    -0.06
    à¤łà¤¨
    -0.06
     fractional
    -0.06
    áºŃp
    -0.06
    POSITIVE LOGITS
    ouve
    0.07
    anton
    0.07
    üns
    0.06
    kers
    0.06
    aket
    0.06
    anka
    0.06
     Consort
    0.06
    ouz
    0.06
    лаж
    0.06
    coli
    0.06
    Act Density 0.017%

    No Known Activations