INDEX
Explanations
mentions of specific sports teams
New Auto-Interp
Negative Logits
catentry
-0.82
REDACTED
-0.67
ascript
-0.67
Predator
-0.66
Ö¼
-0.66
UCT
-0.66
PRES
-0.60
DRAG
-0.59
priceless
-0.59
ãĥīãĥ©
-0.57
POSITIVE LOGITS
puff
0.88
ĵĺ
0.84
itzer
0.81
coe
0.80
erville
0.79
emouth
0.79
inki
0.78
erton
0.73
leck
0.71
reau
0.69
Activations Density 5.200%