INDEX
    Explanations

    references to institutions and their influence in political contexts

    New Auto-Interp
    Negative Logits
     Swinger
    -0.16
    ingu
    -0.16
    ander
    -0.15
    ensa
    -0.15
     pha
    -0.15
    ecycle
    -0.15
    &apos
    -0.14
    428
    -0.14
    TODO
    -0.14
     paperwork
    -0.14
    POSITIVE LOGITS
    itom
    0.16
    wj
    0.15
     pronto
    0.14
    ož
    0.14
     Dit
    0.14
    ipi
    0.13
     nada
    0.13
    och
    0.13
    ãģĸ
    0.13
    vature
    0.13
    Act Density 0.000%

    No Known Activations