INDEX
    Explanations

    references to programs, approvals, and debates

    New Auto-Interp
    Negative Logits
     Wel
    -0.16
    kos
    -0.15
    çħ
    -0.15
    ľ
    -0.15
     Corinth
    -0.15
    inki
    -0.14
    ÑıÑĤÑĮ
    -0.14
     Pis
    -0.13
     sil
    -0.13
    etable
    -0.13
    POSITIVE LOGITS
    compound
    0.16
    ÃĹ↵↵
    0.15
    ,eg
    0.15
    ibold
    0.15
    aepernick
    0.15
    427
    0.15
    assa
    0.15
    RequiredMixin
    0.15
    erdem
    0.15
    unami
    0.15
    Act Density 0.034%

    No Known Activations