INDEX
    Explanations

    articles or determiners in various forms

    New Auto-Interp
    Negative Logits
     poussière
    -0.77
     fumée
    -0.77
     espagne
    -0.71
     Gild
    -0.69
     armée
    -0.69
     charité
    -0.69
     nationaux
    -0.68
     économie
    -0.68
     survie
    -0.67
    ásban
    -0.66
    POSITIVE LOGITS
     a
    1.63
     une
    1.24
     una
    1.24
     eine
    1.23
    Eine
    1.20
     einer
    1.19
     μια
    1.18
     an
    1.15
     uma
    1.13
     một
    1.13
    Act Density 0.017%

    No Known Activations