INDEX
    Explanations

    mathematical expressions and notation, particularly involving derivatives and functions

    New Auto-Interp
    Negative Logits
    Према
    -0.67
    achella
    -0.65
     whoſe
    -0.65
     Ond
    -0.65
     ſmall
    -0.64
     Verſ
    -0.64
     Theſe
    -0.63
     themſelves
    -0.63
     leaſt
    -0.62
     tromper
    -0.62
    POSITIVE LOGITS
    ^{-
    1.10
    )^{-
    1.08
    }^{-
    1.03
     }^{-
    0.92
    ]^{-
    0.81
    ^{-\
    0.76
     ^{-
    0.66
    辞典
    0.63
    ^-
    0.62
     שוליים
    0.62
    Act Density 0.324%

    No Known Activations