INDEX
    Explanations

    articles and determiners in the German language

    New Auto-Interp
    Negative Logits
    principalColumn
    -0.63
     argint
    -0.56
     étoit
    -0.54
    TagMode
    -0.54
    føl
    -0.53
     steder
    -0.52
     verticales
    -0.52
     rând
    -0.52
     calitate
    -0.51
     übrigen
    -0.51
    POSITIVE LOGITS
     een
    2.09
     eine
    1.93
     một
    1.72
     einen
    1.64
     isang
    1.62
    Een
    1.60
     einer
    1.57
    eine
    1.46
    Eine
    1.46
     Een
    1.45
    Act Density 0.035%

    No Known Activations