INDEX
    Explanations

    mentions of politicians, specifically focusing on the name patterns similar to "Biden" and "Clinton"

    names related to political figures or identities

    New Auto-Interp
    Negative Logits
     exha
    -0.89
    vati
    -0.77
     rul
    -0.75
    pmwiki
    -0.75
     weeds
    -0.74
    «ĺ
    -0.73
     murd
    -0.73
    thora
    -0.72
     ILCS
    -0.72
    ãħĭ
    -0.71
    POSITIVE LOGITS
    iden
    1.23
    ovo
    0.97
    vier
    0.92
    ners
    0.91
    ception
    0.89
    unci
    0.88
    heimer
    0.85
    fold
    0.84
    vironment
    0.83
    ning
    0.82
    Act Density 0.013%

    No Known Activations