INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .$-
    1.21
    .$.
    1.20
    $("
    1.15
    '$\
    1.08
    .​​
    1.07
    $('
    1.05
    NumberOperation
    1.03
    $.\
    1.02
    .\
    1.01
    )$.
    0.99
    POSITIVE LOGITS
    s
    1.34
     (>
    1.22
     
    1.21
     (
    1.11
     is
    1.09
     Highness
    1.09
     average
    1.05
    ف
    1.05
     хвати
    1.04
    is
    1.02
    Act Density 3.556%

    No Known Activations