INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ÓS
    -1.09
    zonych
    -1.02
    assertAll
    -1.02
     povezave
    -0.99
    struzioni
    -0.98
    éal
    -0.98
     diejenigen
    -0.94
    влению
    -0.91
    şa
    -0.90
    ínsula
    -0.90
    POSITIVE LOGITS
     same
    10.06
    same
    7.31
    Same
    7.22
    同じ
    6.19
     mesma
    6.00
     Same
    5.91
     mismo
    5.81
     aynı
    5.81
     misma
    5.41
     SAME
    5.34
    Act Density 0.996%

    No Known Activations