INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     आण
    -0.09
     mengambil
    -0.08
     banho
    -0.08
     bath
    -0.08
    -0.08
     सम्प
    -0.08
     वर्तमान
    -0.07
     अप
    -0.07
     Magn
    -0.07
     integrada
    -0.07
    POSITIVE LOGITS
    unexpected
    0.10
    Unexpected
    0.09
    Fizz
    0.08
    ิติ
    0.08
    ეთ
    0.08
     schenken
    0.08
     goodwill
    0.08
    ?!
    0.07
     motif
    0.07
     unexpected
    0.07
    Act Density 0.010%

    No Known Activations