INDEX
    Explanations

    names or titles of world leaders

    references to political leaders, specifically presidents

    New Auto-Interp
    Negative Logits
    opter
    -0.85
    eros
    -0.82
    aughs
    -0.73
    andem
    -0.73
    asca
    -0.71
    ritch
    -0.70
    ourcing
    -0.66
    Scot
    -0.65
    aylor
    -0.64
    ogene
    -0.64
    POSITIVE LOGITS
     Mahmoud
    1.00
     Bashar
    0.91
     negotiator
    0.89
     Jinping
    0.89
     Viktor
    0.85
     Hassan
    0.82
     Mahm
    0.77
     Putin
    0.76
     Tayyip
    0.76
     bloc
    0.75
    Act Density 0.087%

    No Known Activations