INDEX
    Explanations

    references to sports teams, particularly the Lakers and Knicks

    mentions of professional basketball teams, specifically the Lakers and Knicks

    New Auto-Interp
    Negative Logits
    lying
    -0.85
    nels
    -0.73
    lly
    -0.67
    autical
    -0.63
    ravings
    -0.63
    ablishment
    -0.63
    lishes
    -0.62
    umbn
    -0.61
    schild
    -0.61
    remlin
    -0.60
    POSITIVE LOGITS
     Lakers
    1.05
     Clippers
    0.85
     Bryant
    0.82
    orian
    0.80
     Hots
    0.78
     Haram
    0.70
     Basketball
    0.70
    Lens
    0.70
    Zone
    0.69
     basketball
    0.69
    Act Density 0.008%

    No Known Activations