INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    cabul
    -0.78
    Lycka
    -0.70
    OTROS
    -0.69
    Üdv
    -0.67
     squ
    -0.65
     initWithFrame
    -0.64
    فایل‌لار
    -0.64
    mydb
    -0.63
     deb
    -0.62
    imbawa
    -0.62
    POSITIVE LOGITS
    .:
    1.23
    :
    1.16
    []):
    1.15
    __":
    1.13
     :
    1.13
    _:
    1.09
    *:
    1.08
    ???:
    1.08
    ✨:
    1.08
    __':
    1.07
    Act Density 0.242%

    No Known Activations