INDEX
    Explanations

    LaTeX formatting elements in mathematical expressions

    New Auto-Interp
    Negative Logits
    åĩºåĵģ
    -0.16
    urch
    -0.16
    isoft
    -0.16
    ought
    -0.16
    chair
    -0.14
    aged
    -0.14
    ÅĻiv
    -0.14
    ere
    -0.14
    iso
    -0.14
    itr
    -0.14
    POSITIVE LOGITS
    equ
    0.27
    align
    0.26
     equ
    0.25
     align
    0.23
     Align
    0.21
    IEEE
    0.21
     equation
    0.20
     alignment
    0.20
     aligned
    0.20
    eq
    0.20
    Act Density 0.045%

    No Known Activations