INDEX
    Explanations

    articles and possessive pronouns

    New Auto-Interp
    Negative Logits
     Roskov
    -0.86
     AssemblyCulture
    -0.79
    ItemLayout
    -0.76
    ьаж
    -0.75
     Hemsworth
    -0.75
    reibt
    -0.74
    stalgia
    -0.70
     onPostExecute
    -0.68
     apellidos
    -0.67
     rêver
    -0.67
    POSITIVE LOGITS
     der
    1.54
    Der
    1.17
     dieser
    1.09
     Der
    1.04
    Die
    1.00
     ihrer
    1.00
    der
    0.99
     seiner
    0.97
     die
    0.97
     DER
    0.95
    Act Density 0.016%

    No Known Activations