INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
    	yy
    -0.07
     الفكر
    -0.07
    ('>
    -0.07
    :UIControlState
    -0.07
     insistence
    -0.07
    normalize
    -0.07
     balance
    -0.07
    ropping
    -0.06
    -0.06
    POSITIVE LOGITS
    Edges
    0.08
     defiant
    0.07
    _ACTIONS
    0.07
     Lau
    0.07
     mest
    0.07
     zeroes
    0.07
    تأكد
    0.07
    Our
    0.07
    _COM
    0.07
    0.07
    Act Density 0.046%

    No Known Activations