INDEX
    Explanations

    references to college sports teams, particularly basketball and football

    New Auto-Interp
    Negative Logits
     çĶŁåij½åij¨æľŁåĩ½æķ°
    -0.15
    ks
    -0.15
    496
    -0.14
    仲
    -0.14
    .Prot
    -0.14
    ucher
    -0.14
    oq
    -0.14
     Abb
    -0.14
    ismet
    -0.13
     leagues
    -0.13
    POSITIVE LOGITS
    /link
    0.17
     Partial
    0.15
     Experiment
    0.14
    CF
    0.14
    lap
    0.14
    ycle
    0.14
    Partial
    0.14
    cken
    0.14
    GT
    0.13
    arez
    0.13
    Act Density 0.039%

    No Known Activations