INDEX
    Explanations

    mathematical notation and functions, particularly those involving exponents and variables

    New Auto-Interp
    Negative Logits
     Orrell
    -1.04
     שוליים
    -1.02
     Jefus
    -0.99
     fubject
    -0.94
    WriteBarrier
    -0.93
     itſelf
    -0.93
     auffi
    -0.92
     parseFrom
    -0.92
     ProtoMessage
    -0.91
     pinulongan
    -0.90
    POSITIVE LOGITS
    ^{
    1.34
    }^{
    0.86
     Willoughby
    0.78
     ^{
    0.75
    __.
    0.69
    )^{
    0.68
    ))^{
    0.66
    LOGGER
    0.66
    })^{
    0.65
     Kyo
    0.65
    Act Density 0.134%

    No Known Activations