INDEX
    Explanations

    names of specific individuals

    mentions of specific individuals, particularly those related to political contexts

    New Auto-Interp
    Negative Logits
    anim
    -0.86
     Animation
    -0.84
    cius
    -0.79
    gaard
    -0.76
     Anim
    -0.75
    Japan
    -0.72
    Ky
    -0.70
     animation
    -0.69
    Åį
    -0.68
     Guards
    -0.68
    POSITIVE LOGITS
     Donna
    2.30
     Debbie
    1.75
     Podesta
    1.74
     Braz
    1.64
     DNC
    1.56
     Wasserman
    1.54
    к
    1.43
     Chevy
    1.33
     Imran
    1.26
     Betty
    1.23
    Act Density 0.034%

    No Known Activations