INDEX
    Explanations

    mathematical equations and expressions

    New Auto-Interp
    Negative Logits
     اÙĦطب
    -0.07
    erva
    -0.07
    borg
    -0.07
    opi
    -0.07
    uela
    -0.07
    gnu
    -0.07
    اÙģÛĮ
    -0.07
     submar
    -0.07
    erver
    -0.06
    jac
    -0.06
    POSITIVE LOGITS
     halves
    0.08
     half
    0.06
     two
    0.06
    pike
    0.06
    kee
    0.06
     twice
    0.06
    ori
    0.06
     pl
    0.06
    half
    0.05
     midpoint
    0.05
    Act Density 0.045%

    No Known Activations