INDEX
    Explanations

    references to Sarah Palin

    New Auto-Interp
    Negative Logits
    URI
    -0.72
    ires
    -0.65
    Ö¼
    -0.63
    reek
    -0.62
    Increases
    -0.62
    ~~~~~~~~~~~~~~~~
    -0.62
    Oracle
    -0.61
    omething
    -0.61
    EAR
    -0.60
    REE
    -0.60
    POSITIVE LOGITS
     Palin
    1.12
    istani
    0.86
    igon
    0.85
    tera
    0.80
    EStream
    0.80
    bernatorial
    0.79
    aska
    0.77
    steen
    0.77
    ohydrate
    0.75
     Schwarzenegger
    0.74
    Act Density 0.008%

    No Known Activations