INDEX
Explanations
locations or entities related to sports teams
proper nouns related to specific teams, locations, and individuals
New Auto-Interp
Negative Logits
ugal
-0.77
initely
-0.75
isting
-0.73
axter
-0.72
ista
-0.68
Emerson
-0.68
hunger
-0.68
istor
-0.67
ando
-0.66
ists
-0.66
POSITIVE LOGITS
enegger
0.90
creen
0.79
DAY
0.73
lus
0.71
Boll
0.71
kus
0.67
orage
0.67
mber
0.66
Squirrel
0.66
vine
0.66
Activations Density 0.048%