INDEX
    Explanations

    references to the quantity and involvement of individuals or groups in various contexts

    New Auto-Interp
    Negative Logits
     mostly
    -0.19
    iversit
    -0.17
     always
    -0.16
    ensa
    -0.16
     tất
    -0.15
     siempre
    -0.15
    mostly
    -0.15
     Mostly
    -0.15
     vždy
    -0.14
    always
    -0.14
    POSITIVE LOGITS
     simply
    0.20
     Simply
    0.18
    Simply
    0.17
    348
    0.16
    -times
    0.15
    gree
    0.15
    arda
    0.14
     simplement
    0.14
    ogg
    0.14
     already
    0.14
    Act Density 0.138%

    No Known Activations