INDEX
    Explanations

    references to presidential candidates and political figures

    New Auto-Interp
    Negative Logits
    isme
    -0.16
    ilm
    -0.15
    orum
    -0.15
    usat
    -0.14
    acky
    -0.14
    wers
    -0.14
    ersions
    -0.14
    ieux
    -0.14
    wert
    -0.14
     Tribe
    -0.14
    POSITIVE LOGITS
    abella
    0.16
    orage
    0.15
    ÅĤaw
    0.14
    ECT
    0.14
     LETTER
    0.14
    ëĭĺ
    0.14
    Äįer
    0.14
    -Sep
    0.14
    RequestId
    0.13
     treatments
    0.13
    Act Density 0.077%

    No Known Activations