INDEX
Explanations
references to "Big" followed by numbers, indicating a focus on sports conference names
New Auto-Interp
Negative Logits
istry
-0.84
idency
-0.78
confir
-0.77
theless
-0.73
fully
-0.69
yrim
-0.68
muster
-0.67
ILA
-0.67
cia
-0.67
lessness
-0.66
POSITIVE LOGITS
gest
1.46
ger
1.24
gie
1.05
Brother
0.99
gins
0.93
gers
0.90
wig
0.89
Daddy
0.87
Bang
0.86
glers
0.85
Activations Density 0.018%