INDEX
    Explanations

    mathematical notation indicating functions or operators

    New Auto-Interp
    Negative Logits
    5
    -0.58
    3
    -0.56
    4
    -0.56
    <em>
    -0.56
    1
    -0.55
    8
    -0.53
    7
    -0.53
    </blockquote>
    -0.53
    2
    -0.52
    9
    -0.49
    POSITIVE LOGITS
    right
    1.45
    RIGHT
    1.00
     right
    0.86
    Right
    0.80
     Right
    0.76
    righ
    0.75
     RIGHT
    0.75
    bigr
    0.73
     ویکی‌پدیای
    0.71
    iastes
    0.70
    Act Density 0.054%

    No Known Activations