INDEX
    Explanations

    references to government, regulations, and orders related to social structures and policies

    New Auto-Interp
    Negative Logits
    AWN
    -0.17
    awn
    -0.15
    odash
    -0.14
    ober
    -0.14
    ì£ł
    -0.14
    chez
    -0.14
    aeda
    -0.14
     quá
    -0.14
    ojis
    -0.14
    ÑģÑĤи
    -0.13
    POSITIVE LOGITS
     finally
    0.24
    finally
    0.21
     either
    0.19
     eventually
    0.19
    both
    0.18
     both
    0.18
     somehow
    0.17
     tore
    0.17
     simplement
    0.17
     simply
    0.16
    Act Density 0.246%

    No Known Activations