INDEX
    Explanations

    mathematical expressions and notation

    Mathematical or arithmetic symbols

    mathematical operators + -

    New Auto-Interp
    Negative Logits
     comparison
    -0.57
    ')):
    -0.57
    '])->
    -0.56
     שוליים
    -0.53
    )();
    -0.52
    égard
    -0.51
    )');
    -0.51
     compare
    -0.50
    ))^{
    -0.50
    ")));
    
    -0.50
    POSITIVE LOGITS
     +
    1.88
     plus
    1.65
    +
    1.58
     $+$
    1.51
    1.44
     плюс
    1.43
     PLUS
    1.40
     $+
    1.39
     + 
    1.34
    }+
    1.33
    Act Density 1.815%

    No Known Activations