INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    AspNet
    -0.07
     within
    -0.07
     ощ
    -0.07
    ênh
    -0.07
     امنیت
    -0.07
     sing
    -0.07
    _ENUM
    -0.06
     Cluster
    -0.06
    ("^
    -0.06
    něž
    -0.06
    POSITIVE LOGITS
     made
    0.14
     Made
    0.10
    made
    0.10
    -made
    0.09
    Made
    0.08
     MADE
    0.07
     backyard
    0.07
    ade
    0.07
     poking
    0.07
    de
    0.06
    Act Density 0.016%

    No Known Activations