INDEX
    Explanations

    occurrences of mathematical symbols and formatting elements that indicate mathematical notation

    New Auto-Interp
    Negative Logits
    eva
    -0.16
    rou
    -0.16
    horia
    -0.16
    oser
    -0.15
    quip
    -0.15
     abstract
    -0.15
    Sharp
    -0.14
    ä¾
    -0.14
     tip
    -0.14
     Lars
    -0.14
    POSITIVE LOGITS
    imes
    0.37
    extr
    0.23
    entimes
    0.20
     times
    0.20
    anh
    0.18
    imest
    0.18
    _times
    0.17
    ies
    0.17
    ims
    0.16
    ines
    0.16
    Act Density 0.007%

    No Known Activations