INDEX
    Explanations

    mathematical expressions and formulas involving variables and symbols

    Parentheses followed by specific characters

    mathematical and code expressions

    New Auto-Interp
    Negative Logits
    ((
    -0.81
    rungsseite
    -0.80
    (
    -0.80
    $
    -0.75
     فريبيس
    -0.70
     незавершена
    -0.68
     kasarigan
    -0.67
    $("#
    -0.65
     ***!
    -0.64
    $\
    -0.61
    POSITIVE LOGITS
    !!!)
    0.83
    !!)
    0.81
    ..)
    0.77
    ??)
    0.74
    ?),
    0.73
    ,)
    0.71
    --)
    0.70
    ....)
    0.70
    ++)
    0.68
    ?).
    0.65
    Act Density 0.789%

    No Known Activations