INDEX
Explanations
references to sports teams and their performances
New Auto-Interp
Negative Logits
Accessory
-0.85
ariat
-0.77
Tablet
-0.75
bush
-0.72
ously
-0.71
dit
-0.71
tar
-0.70
ibrary
-0.70
Delivery
-0.69
tein
-0.65
POSITIVE LOGITS
peak
1.11
heet
0.98
vying
0.90
etter
0.86
competing
0.86
ubs
0.84
ettings
0.81
eries
0.79
ete
0.77
compete
0.75
Activations Density 0.034%