INDEX
Explanations
mentions of specific college teams and players in a sports context
New Auto-Interp
Negative Logits
aska
-0.19
RLF
-0.15
quist
-0.15
ennon
-0.14
ä¿Ĭ
-0.14
agn
-0.14
utenberg
-0.14
ACHE
-0.14
омеÑĢ
-0.14
uator
-0.14
POSITIVE LOGITS
Duke
0.42
duke
0.30
Durham
0.30
duk
0.24
DU
0.24
Dur
0.24
DU
0.23
duk
0.22
æķ·
0.22
Blue
0.21
Activations Density 0.007%