INDEX
    Explanations

    phrases related to political figures and events

    mentions of the name "Trump" in various contexts

    New Auto-Interp
    Negative Logits
     warp
    -0.72
     Droid
    -0.72
     Guinness
    -0.72
     Bulgarian
    -0.69
     Nordic
    -0.65
     variance
    -0.65
     Voyager
    -0.65
     cyan
    -0.64
     decomp
    -0.64
     cloak
    -0.63
    POSITIVE LOGITS
    ¬
    1.16
    £
    1.12
    ı
    1.11
    Į
    1.11
    ¹
    1.09
    ¦
    1.03
    ª
    1.02
    Ī
    1.00
    į
    1.00
    Ĵ
    0.98
    Act Density 0.376%

    No Known Activations