INDEX
    Explanations

    baseball players

    New Auto-Interp
    Negative Logits
     چنان
    -0.08
     بیشتر
    -0.06
    [C
    -0.06
    _Anim
    -0.06
    	Item
    -0.06
    ilot
    -0.06
     Fully
    -0.06
    ######
    -0.06
    르고
    -0.06
    manız
    -0.06
    POSITIVE LOGITS
     bug
    0.07
    adní
    0.07
    0.06
    ,len
    0.06
     lái
    0.06
     ört
    0.06
    emotion
    0.06
    0.06
    활동
    0.06
     относится
    0.06
    Act Density 0.002%

    No Known Activations