INDEX
    Explanations

    articles indicating singular and plural nouns

    New Auto-Interp
    Negative Logits
     Mona
    -0.17
     Tone
    -0.15
     (~(
    -0.14
    uos
    -0.14
     imper
    -0.13
     Gast
    -0.13
     bos
    -0.13
    .xz
    -0.13
     mole
    -0.13
     tone
    -0.13
    POSITIVE LOGITS
    lsen
    0.17
    ucha
    0.16
    ẩm
    0.15
    iliar
    0.15
     Levy
    0.15
    .scalar
    0.15
     anale
    0.15
    lparr
    0.15
     Fukushima
    0.14
    alement
    0.14
    Act Density 0.026%

    No Known Activations