INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     onchange
    -0.07
     Hill
    -0.07
     cartesian
    -0.07
     عملکرد
    -0.07
     Parse
    -0.07
     illusion
    -0.06
     ellipse
    -0.06
     Constant
    -0.06
    _bound
    -0.06
     Dice
    -0.06
    POSITIVE LOGITS
     sponsor
    0.10
     Sponsor
    0.09
     sponsoring
    0.08
    0.08
     sponsorship
    0.08
    sponsor
    0.07
    0.07
    0.07
     sponsored
    0.07
    0.07
    Act Density 0.007%

    No Known Activations