INDEX
    Explanations

    mathematical notation and formulas

    New Auto-Interp
    Negative Logits
     раз
    0.89
    ו
    0.86
    з
    0.85
     ממש
    0.83
    ه
    0.81
     منشور
    0.80
    0.79
     народов
    0.77
     IE
    0.77
     ه
    0.76
    POSITIVE LOGITS
    mathbb
    1.28
    text
    1.27
    tilde
    1.26
    frac
    1.24
    overline
    1.24
    sqrt
    1.21
    textrm
    1.18
    ldots
    1.17
    mu
    1.15
    rightarrow
    1.15
    Act Density 0.047%

    No Known Activations