INDEX
    Explanations

    various numerical representations

    New Auto-Interp
    Negative Logits
     nine
    1.64
     Nine
    1.50
    Nine
    1.42
    nine
    1.33
     nueve
    1.27
     eleven
    1.24
     nineties
    1.21
    1.18
     neun
    1.17
     ninth
    1.13
    POSITIVE LOGITS
    2
    1.49
    1
    1.33
    1.33
    1.32
    3
    1.23
     одну
    1.22
    1.21
     डेढ़
    1.18
     jedną
    1.18
     ۲
    1.14
    Act Density 0.415%

    No Known Activations