INDEX
    Explanations

    references to former political figures and officials

    New Auto-Interp
    Negative Logits
    former
    -0.20
     former
    -0.18
    Former
    -0.18
     formerly
    -0.17
     Former
    -0.17
    缮åīį
    -0.16
    /he
    -0.16
    yy
    -0.15
     older
    -0.15
    uck
    -0.15
    POSITIVE LOGITS
    /current
    0.34
    /original
    0.19
    odus
    0.19
     Yugoslavia
    0.18
    /new
    0.18
    ly
    0.17
     employees
    0.16
    asper
    0.16
    LY
    0.16
    ucha
    0.16
    Act Density 0.048%

    No Known Activations