INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    0.42
    แส
    0.41
     جبر
    0.40
    0.39
    长安
    0.39
     সম্ভব
    0.38
     możli
    0.38
     बल्कि
    0.37
    ོང་
    0.37
     ممکن
    0.36
    POSITIVE LOGITS
     soft
    1.27
     Soft
    1.16
    Soft
    1.13
     softer
    1.12
    soft
    1.10
    1.09
    1.01
     SOFT
    1.00
     softness
    0.95
    0.90
    Act Density 0.016%

    No Known Activations