INDEX
Explanations
references to a specific sports team named "Lions"
mentions of the Detroit Lions football team
New Auto-Interp
Negative Logits
mble
-0.95
lly
-0.93
elsius
-0.87
ntil
-0.86
srf
-0.84
DonaldTrump
-0.79
lying
-0.79
Seym
-0.78
nces
-0.77
aleigh
-0.76
POSITIVE LOGITS
Lions
1.37
Tigers
0.96
Pistons
0.83
lions
0.82
Clubs
0.81
Packers
0.81
Wolves
0.80
Bears
0.78
Beasts
0.76
Cowboys
0.75
Activations Density 0.009%