INDEX
    Explanations

    names of specific political figures

    mentions of specific individuals, particularly politicians

    New Auto-Interp
    Negative Logits
    ripp
    -0.92
    fman
    -0.92
    BOOK
    -0.83
    chrom
    -0.82
    hered
    -0.79
    Seattle
    -0.78
    naire
    -0.77
    plex
    -0.77
    earing
    -0.77
    bodied
    -0.76
    POSITIVE LOGITS
     Mahmoud
    1.22
     Abbas
    1.08
     Ahmad
    0.90
     Mahm
    0.87
    ollah
    0.86
     Gh
    0.86
     Mubarak
    0.84
     Ahmed
    0.80
     Meh
    0.79
     Abdel
    0.79
    Act Density 0.011%

    No Known Activations