INDEX
    Explanations

    political figures or media outlets being criticized

    instances of the word "criticized" and its variations

    New Auto-Interp
    Negative Logits
    OTE
    -0.73
    ëĭ
    -0.71
    bourne
    -0.69
    mop
    -0.68
    ther
    -0.67
    ulhu
    -0.66
    aho
    -0.66
    ammy
    -0.65
    nown
    -0.65
    mad
    -0.64
    POSITIVE LOGITS
    imaru
    0.73
     Cosponsors
    0.73
     harshly
    0.69
     comments
    0.63
     remarks
    0.63
     Stab
    0.62
     him
    0.61
     Orb
    0.61
     critiques
    0.61
     critic
    0.61
    Act Density 0.050%

    No Known Activations