INDEX
    Explanations

    references to specific sports teams and their performance

    New Auto-Interp
    Negative Logits
    undi
    -0.15
    }elseif
    -0.15
    icho
    -0.14
    rchive
    -0.14
    apsed
    -0.14
    contro
    -0.14
    ÙĪÙĦÙĩ
    -0.14
    elsey
    -0.14
    ãģ²ãģ¨
    -0.13
     Sands
    -0.13
    POSITIVE LOGITS
    ettes
    0.23
     Daw
    0.21
    hawks
    0.20
     faithful
    0.20
    cats
    0.19
    birds
    0.19
    inals
    0.18
    men
    0.17
     Dogs
    0.16
    mps
    0.16
    Act Density 0.044%

    No Known Activations