INDEX
    Explanations

    references to sports teams and their performances

    New Auto-Interp
    Negative Logits
    robat
    -0.17
    .sav
    -0.17
     thái
    -0.15
     Shank
    -0.15
    ignet
    -0.15
    successfully
    -0.15
    åĨł
    -0.14
    gnore
    -0.14
    IMA
    -0.14
    زاÙĨ
    -0.14
    POSITIVE LOGITS
     fal
    0.32
     struggle
    0.30
     struggles
    0.28
     succ
    0.27
     lim
    0.27
     fail
    0.24
     struggled
    0.24
     suffer
    0.23
     struggling
    0.23
     lost
    0.23
    Act Density 0.117%

    No Known Activations