INDEX
    Explanations

    addition of numbers and symbols

    New Auto-Interp
    Negative Logits
    decreasing
    0.50
    ominus
    0.41
    ായിരുന്നു
    0.40
    েবের
    0.40
     $-
    0.40
    Minus
    0.40
    0.39
    walled
    0.39
    othelium
    0.38
    काया
    0.38
    POSITIVE LOGITS
     +
    2.02
     plus
    1.65
     плюс
    1.61
    +
    1.59
    加上
    1.55
     $+$
    1.50
     ditambah
    1.49
    ()+
    1.45
    再加上
    1.45
    1.45
    Act Density 0.078%

    No Known Activations