INDEX
    Explanations

    approximations and ranges

    New Auto-Interp
    Negative Logits
     ಹಣ
    0.26
    较低
    0.25
    eseorang
    0.24
     suatu
    0.24
    ‌پ
    0.23
     $*$-
    0.23
     വൈദ്യുതി
    0.23
     erfolgen
    0.23
    <unused217>
    0.23
    <unused2051>
    0.23
    POSITIVE LOGITS
     approximately
    0.67
     roughly
    0.57
     about
    0.55
     approx
    0.54
     ~
    0.54
     around
    0.52
     yaklaşık
    0.52
    approximately
    0.49
    0.49
    大约
    0.47
    Act Density 0.251%

    No Known Activations