INDEX
    Explanations

    bright smile, confidence

    New Auto-Interp
    Negative Logits
    א
    0.80
    ι
    0.70
    0.70
    ol
    0.70
    نا
    0.66
    ip
    0.64
    0.63
    ونية
    0.61
    ام
    0.60
    ুর
    0.60
    POSITIVE LOGITS
    0
    0.80
     bebas
    0.66
     exiled
    0.64
     americ
    0.63
     you
    0.62
    zelf
    0.61
     novoProduto
    0.60
     виправи
    0.59
     엔진
    0.59
     amerikan
    0.59
    Act Density 0.001%

    No Known Activations