INDEX
    Explanations

    mathematical symbols and expressions, particularly equalities and inequalities

    New Auto-Interp
    Negative Logits
     utafitiHapana
    -0.85
     Chwiliwch
    -0.68
    tagHelperRunner
    -0.65
     ſever
    -0.64
    тельству
    -0.61
     ویکی‌پدیای
    -0.59
     beſ
    -0.59
    ſelf
    -0.59
     vocês
    -0.58
    -0.58
    POSITIVE LOGITS
    >=</
    1.45
    }=
    1.25
     }}=\
    1.19
    }=\
    1.19
    ']=
    1.17
     $=\
    1.17
    /=
    1.15
     $=
    1.14
    =\
    1.13
     }}=
    1.10
    Act Density 0.523%

    No Known Activations