INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    gab
    -0.08
     Dl
    -0.08
    -0.08
    одаря
    -0.08
    726
    -0.07
     gab
    -0.07
     повышения
    -0.07
    psilon
    -0.07
    	ff
    -0.07
    EPA
    -0.07
    POSITIVE LOGITS
     पुर
    0.08
     victim
    0.08
     escl
    0.07
     sympat
    0.07
     dread
    0.07
    Vict
    0.07
     fib
    0.07
     SCC
    0.07
     victims
    0.07
     slender
    0.07
    Act Density 0.001%

    No Known Activations