INDEX
    Explanations

    acronyms, foreign scripts, programming, lists

    New Auto-Interp
    Negative Logits
    underwear
    0.44
     Extensions
    0.43
    uance
    0.43
    typically
    0.42
     imasmim
    0.41
     sourceL
    0.40
     ಇರುವ
    0.40
     extensions
    0.40
    0.40
     ఉండ
    0.39
    POSITIVE LOGITS
     bravely
    0.47
    ద్య
    0.43
    φέρον
    0.42
    ю
    0.42
     averted
    0.41
     gave
    0.41
    ερ
    0.41
     প্রতিষ্ঠার
    0.41
     failed
    0.40
    ર્
    0.40
    Act Density 0.002%

    No Known Activations