INDEX
Explanations
references to sports teams and their performance metrics
New Auto-Interp
Negative Logits
念
-0.16
çīĻ
-0.15
Charlottesville
-0.15
.bt
-0.14
Sir
-0.14
ñana
-0.14
Sir
-0.14
stadium
-0.14
ÙĨÛĮÙĨ
-0.14
zte
-0.14
POSITIVE LOGITS
NA
0.22
NA
0.19
Naz
0.17
WSC
0.17
Division
0.17
Division
0.16
GN
0.16
avra
0.16
oplayer
0.15
uja
0.15
Activations Density 0.038%