INDEX
Explanations
phrases related to sports rules and controversies
New Auto-Interp
Negative Logits
imler
-0.15
STD
-0.15
Pierce
-0.15
-sur
-0.15
demonstration
-0.15
лекÑģанд
-0.14
заÑģÑĤ
-0.14
kus
-0.14
input
-0.13
.tw
-0.13
POSITIVE LOGITS
852
0.16
azzi
0.15
chai
0.15
zn
0.15
ÑĥлÑı
0.15
qui
0.14
ellen
0.14
審
0.14
ãĥ³ãĥģ
0.14
911
0.14
Activations Density 0.348%