INDEX
    Explanations

    mentions of political figures, particularly former presidents and vice presidents

    references to political figures and their titles

    New Auto-Interp
    Negative Logits
    Sensor
    -0.68
    atum
    -0.64
    eta
    -0.63
     radius
    -0.63
    âĹ¼
    -0.63
    fw
    -0.62
    endi
    -0.62
    Tree
    -0.62
    Limited
    -0.60
     issu
    -0.60
    POSITIVE LOGITS
     Yugoslavia
    0.89
     Saddam
    0.88
     Colin
    0.79
     Lyndon
    0.79
    turned
    0.78
     Watergate
    0.77
     Newt
    0.77
     Abel
    0.77
     disgr
    0.75
     Yugoslav
    0.75
    Act Density 0.175%

    No Known Activations