INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     mismo
    -0.10
     misma
    -0.10
     mateixa
    -0.10
    same
    -0.10
     mesma
    -0.10
     hoʻi
    -0.09
     stessi
    -0.09
     stessa
    -0.09
     mesmo
    -0.09
    _same
    -0.09
    POSITIVE LOGITS
     ident
    0.22
     иден
    0.21
    ident
    0.20
     Ident
    0.18
    _ident
    0.17
    Ident
    0.17
    -ident
    0.17
    'ident
    0.16
    .ident
    0.16
     the
    0.15
    Act Density 0.093%

    No Known Activations