INDEX
    Explanations

    mathematical equations and notation

    New Auto-Interp
    Negative Logits
     Base
    -0.21
     Body
    -0.20
    (Base
    -0.19
    Base
    -0.19
     Brick
    -0.19
     Beh
    -0.18
     Branch
    -0.18
     Bel
    -0.18
    Body
    -0.18
     Block
    -0.17
    POSITIVE LOGITS
    -b
    0.79
    Âłb
    0.72
    	b
    0.64
    +b
    0.64
    *b
    0.63
    =b
    0.63
    /b
    0.62
    .b
    0.61
    ,b
    0.60
    :b
    0.60
    Act Density 1.017%

    No Known Activations