INDEX
    Explanations

    the name "Donald Trump" in various contexts

    New Auto-Interp
    Negative Logits
    pole
    -0.81
     dots
    -0.77
     foss
    -0.68
    erb
    -0.67
    duct
    -0.65
    esville
    -0.64
     notebooks
    -0.64
    shapeshifter
    -0.63
    papers
    -0.63
    kers
    -0.62
    POSITIVE LOGITS
     Jinping
    0.76
    terness
    0.75
     Bid
    0.73
     Abrams
    0.70
    thora
    0.68
     Marshall
    0.67
     Doyle
    0.66
    °
    0.66
    ª
    0.65
     Griffin
    0.64
    Act Density 0.116%

    No Known Activations