INDEX
    Explanations

    sports teams

    New Auto-Interp
    Negative Logits
     Ajax
    -0.07
     fashioned
    -0.07
    bottom
    -0.07
     Coil
    -0.07
    -0.07
    unsupported
    -0.07
     rootReducer
    -0.07
    -0.07
    إصدار
    -0.06
    -hours
    -0.06
    POSITIVE LOGITS
    /Y
    0.07
     harassing
    0.07
    0.06
     trafficking
    0.06
    flate
    0.06
    OTAL
    0.06
    简单
    0.06
    bet
    0.06
    甜甜
    0.06
    的事情
    0.06
    Act Density 0.020%

    No Known Activations