INDEX
    Explanations

    mathematical expressions with fractions and square roots

    New Auto-Interp
    Negative Logits
    0.94
    0.93
     🔥
    0.90
    0.89
    0.88
     loves
    0.84
    0.84
    0.83
     *_
    0.82
    0.81
    POSITIVE LOGITS
    {
    2.34
    {\
    1.72
    {-
    1.59
    {(
    1.53
    {(\
    1.34
    {{\
    1.29
    {}{
    1.26
    {[
    1.24
    {-\
    1.22
    {&
    1.19
    Act Density 0.183%

    No Known Activations