INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     geboren
    0.53
     birthplace
    0.48
    出生
    0.46
     kelahiran
    0.45
     born
    0.44
     lahir
    0.42
    рожден
    0.41
     ANGELES
    0.39
     जन्मे
    0.39
    °-
    0.39
    POSITIVE LOGITS
    จัก
    0.41
     অন্তত
    0.40
     nesse
    0.39
    0.38
    ksjon
    0.37
     tudi
    0.37
     ያል
    0.37
    passen
    0.37
     ไล
    0.37
     భారీ
    0.36
    Act Density 0.001%

    No Known Activations