INDEX
    Explanations

    political party affiliations, particularly focusing on Democrats and Republicans

    New Auto-Interp
    Negative Logits
    elan
    -0.15
    تا
    -0.14
     Horn
    -0.14
    иком
    -0.14
    ÑĢоÑĩ
    -0.14
    rong
    -0.14
    ĸī
    -0.13
     Parser
    -0.13
     Mercer
    -0.13
    lj
    -0.13
    POSITIVE LOGITS
    atatype
    0.17
    oola
    0.16
    ennes
    0.16
    kili
    0.15
    NullException
    0.15
    enant
    0.15
     Mig
    0.15
    ipline
    0.14
    antine
    0.14
    rollback
    0.14
    Act Density 0.009%

    No Known Activations