INDEX
    Explanations

    references to historical figures and their political significance

    New Auto-Interp
    Negative Logits
    lsen
    -0.15
    iple
    -0.15
    ιβ
    -0.14
    inaire
    -0.14
     åij
    -0.14
    æĦŁæĥħ
    -0.14
    apo
    -0.14
    Ùĥار
    -0.14
    paged
    -0.13
    골
    -0.13
    POSITIVE LOGITS
     office
    0.19
     term
    0.18
    office
    0.17
    ứng
    0.15
    avit
    0.14
     runApp
    0.14
     Office
    0.14
    nsic
    0.14
    ëıħ
    0.14
     पद
    0.14
    Act Density 0.060%

    No Known Activations