INDEX
Explanations
words related to sports teams or organizations
references to specific sports teams
New Auto-Interp
Negative Logits
ously
-0.83
ĺħ
-0.81
ibrary
-0.75
oscope
-0.71
ocene
-0.67
dams
-0.64
osc
-0.64
plague
-0.64
tumblr
-0.63
itures
-0.63
POSITIVE LOGITS
mates
1.29
mate
1.07
liquid
1.04
sters
1.00
Liquid
0.97
Fortress
0.92
mates
0.91
Solo
0.87
Dign
0.87
Rocket
0.87
Activations Density 0.055%