INDEX
    Explanations

    articles or determiners, particularly focusing on the term "A" and its variations

    New Auto-Interp
    Negative Logits
     keju
    -0.45
    Liefs
    -0.43
    водства
    -0.43
    wapV
    -0.43
     Occidente
    -0.43
     antworte
    -0.42
    Bruh
    -0.42
    Partager
    -0.42
     Logistik
    -0.42
    ulemon
    -0.41
    POSITIVE LOGITS
     nonUne
    0.46
    ETHING
    0.43
    __*/
    0.41
    といけない
    0.40
    combination
    0.40
     chi̍t
    0.40
    intégr
    0.39
    0.39
    noy
    0.39
     كومونز
    0.39
    Act Density 0.441%

    No Known Activations