INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -addons
    -0.16
    isman
    -0.15
    RowAt
    -0.15
    loser
    -0.14
    .contentSize
    -0.14
    edic
    -0.14
     Ùħع
    -0.13
    registr
    -0.13
     Chick
    -0.13
    ượng
    -0.13
    POSITIVE LOGITS
     class
    0.50
    class
    0.37
     className
    0.34
    	class
    0.34
     Class
    0.31
    -class
    0.28
     клаÑģÑģ
    0.28
    _class
    0.28
    Class
    0.27
    (class
    0.27
    Act Density 0.022%

    No Known Activations