INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     
    0.63
    ون
    0.60
    of
    0.58
    ي
    0.58
    ur
    0.57
    al
    0.56
    ä
    0.55
    og
    0.53
    ir
    0.52
    one
    0.52
    POSITIVE LOGITS
     attuale
    0.70
    varande
    0.63
    0.61
    adays
    0.60
    ne
    0.57
     inmedi
    0.54
    ချိန်
    0.54
    0.53
    いた
    0.53
     현재
    0.53
    Act Density 0.081%

    No Known Activations