INDEX
    Explanations

    references to George W. Bush

    New Auto-Interp
    Negative Logits
     Shreve
    -0.82
     nguyễn
    -0.78
    ransition
    -0.77
    atedral
    -0.75
    kler
    -0.74
     McKinley
    -0.74
    avits
    -0.73
    ligators
    -0.72
     Chall
    -0.72
    Jakub
    -0.71
    POSITIVE LOGITS
     George
    2.21
    George
    1.99
    george
    1.80
     GEORGE
    1.76
     george
    1.70
    GEORGE
    1.59
     Georges
    1.58
    Georges
    1.29
     Geo
    1.20
     Georgie
    1.18
    Act Density 0.016%

    No Known Activations