INDEX
    Explanations

    references to government officials and their actions

    New Auto-Interp
    Negative Logits
    fall
    -0.16
    ç³
    -0.15
    maid
    -0.15
    /div
    -0.14
    ingly
    -0.14
    pective
    -0.14
    AMESPACE
    -0.14
    imu
    -0.14
    erer
    -0.13
    osa
    -0.13
    POSITIVE LOGITS
    844
    0.17
    ÙĤات
    0.17
    hower
    0.15
    dom
    0.15
    enthal
    0.15
    /admin
    0.14
    isex
    0.14
    Äĥ
    0.14
    itaire
    0.14
    owl
    0.14
    Act Density 0.024%

    No Known Activations