INDEX
    Explanations

    references to administrative processes and requirements in a bureaucratic context

    New Auto-Interp
    Negative Logits
    ahn
    -0.17
    ault
    -0.15
    nton
    -0.14
    lijah
    -0.14
     Lit
    -0.14
    CPF
    -0.14
    atrice
    -0.13
     bomb
    -0.13
     pie
    -0.13
    142
    -0.13
    POSITIVE LOGITS
     instead
    0.19
     Instead
    0.17
    instead
    0.16
    ÏĦιÏĥ
    0.15
    agara
    0.15
     вмеÑģÑĤ
    0.15
    ola
    0.15
    vál
    0.14
    iche
    0.14
    iment
    0.14
    Act Density 0.180%

    No Known Activations