INDEX
    Explanations

    definite articles and pronouns in Spanish

    New Auto-Interp
    Negative Logits
    -0.49
    øse
    -0.40
    jalá
    -0.40
    yanto
    -0.40
    US
    -0.40
     chrétien
    -0.39
    5
    -0.39
    dirond
    -0.38
    kjø
    -0.37
    ñado
    -0.36
    POSITIVE LOGITS
     Administrativna
    0.81
     the
    0.79
     την
    0.75
     surla
    0.73
     la
    0.73
     Infórmanos
    0.71
    حياتها
    0.69
    Parcelize
    0.68
     المعيارى
    0.68
    AxisAlignment
    0.65
    Act Density 0.010%

    No Known Activations