INDEX
    Explanations

    mentions of political leaders or significant government figures

    New Auto-Interp
    Negative Logits
     fta
    -1.42
     effe
    -1.37
     ftu
    -1.35
     thut
    -1.35
     aen
    -1.34
     fep
    -1.31
     fatis
    -1.30
     mef
    -1.29
     secon
    -1.29
     fte
    -1.28
    POSITIVE LOGITS
     himself
    1.09
    '
    0.78
    0.77
    himself
    0.74
     herself
    0.74
     Himself
    0.71
    ׳
    0.70
    cellor
    0.69
     president
    0.67
     who
    0.66
    Act Density 0.288%

    No Known Activations