INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     khó
    -0.08
     ngân
    -0.08
     hadda
    -0.08
    oader
    -0.08
    65
    -0.08
     יכולה
    -0.08
    ოლოგ
    -0.08
    seven
    -0.08
    -0.08
    ẩn
    -0.08
    POSITIVE LOGITS
     Sorgen
    0.07
     resto
    0.07
     restante
    0.07
    Parte
    0.07
     parti
    0.07
    .calls
    0.07
     estén
    0.07
     psic
    0.07
    oron
    0.07
    nya
    0.07
    Act Density 0.027%

    No Known Activations