INDEX
    Explanations

    references to a specific sports team named "Lions"

    mentions of the Detroit Lions football team

    New Auto-Interp
    Negative Logits
    mble
    -0.95
    lly
    -0.93
    elsius
    -0.87
    ntil
    -0.86
     srf
    -0.84
    DonaldTrump
    -0.79
    lying
    -0.79
     Seym
    -0.78
    nces
    -0.77
    aleigh
    -0.76
    POSITIVE LOGITS
     Lions
    1.37
     Tigers
    0.96
     Pistons
    0.83
     lions
    0.82
     Clubs
    0.81
     Packers
    0.81
     Wolves
    0.80
     Bears
    0.78
     Beasts
    0.76
     Cowboys
    0.75
    Act Density 0.009%

    No Known Activations