INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    geries
    -0.52
    lag
    -0.52
     liên
    -0.52
    ghted
    -0.51
     Ry
    -0.51
     sobre
    -0.48
    i
    -0.48
    bor
    -0.48
    داً
    -0.48
    ctuary
    -0.47
    POSITIVE LOGITS
     parents
    2.87
     Parents
    2.57
    Parents
    2.48
     PARENTS
    2.40
    parents
    2.38
     genitori
    1.92
     ouders
    1.88
     Eltern
    1.87
     padres
    1.69
     родители
    1.65
    Act Density 0.058%

    No Known Activations