INDEX
    Explanations

    mathematical expressions and functions

    New Auto-Interp
    Negative Logits
     BI
    -0.36
    BOT
    -0.36
    BI
    -0.35
     Borne
    -0.34
    Bru
    -0.34
    quias
    -0.33
    TestingModule
    -0.33
    BAN
    -0.33
    obenz
    -0.33
    SpringRunner
    -0.33
    POSITIVE LOGITS
     b
    2.94
    b
    2.53
     б
    1.49
    1.25
    б
    1.13
     ب
    1.03
    𝑏
    0.99
    0.94
    ب
    0.89
    bB
    0.85
    Act Density 1.397%

    No Known Activations