INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    bbs
    -0.08
     Bubble
    -0.07
     diter
    -0.07
    -0.07
     прост
    -0.07
     pojed
    -0.07
    ibon
    -0.07
    -0.07
    _apps
    -0.07
    ાત્મક
    -0.07
    POSITIVE LOGITS
    0.12
     श्रेष्ठ
    0.11
    0.11
     inferior
    0.10
    Compared
    0.10
     northeast
    0.10
     ఎక్కువ
    0.10
     نسبت
    0.10
    iore
    0.09
     superiority
    0.09
    Act Density 0.049%

    No Known Activations