INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    mybatisplus
    -0.94
     Paglinawan
    -0.87
    findpost
    -0.82
     betweenstory
    -0.77
     للمعارف
    -0.72
    //
    -0.69
    تقاوى
    -0.68
     مشين
    -0.68
    expandindo
    -0.67
     ویکی‌پدیا
    -0.66
    POSITIVE LOGITS
     mod
    0.44
    -${
    0.41
     $\$
    0.40
     hex
    0.39
    .${
    0.39
    vos
    0.38
     nec
    0.38
     simp
    0.38
    fis
    0.37
     condu
    0.37
    Act Density 0.015%

    No Known Activations