INDEX
Explanations
phrases related to scores and leading in sports contexts
New Auto-Interp
Negative Logits
ãĥĨãĥ«
-0.18
ÃŃd
-0.17
iesel
-0.16
Ben
-0.16
üns
-0.15
nuts
-0.15
hei
-0.15
acomp
-0.15
remotely
-0.14
ollapsed
-0.14
POSITIVE LOGITS
ional
0.15
xFFF
0.15
aker
0.15
onal
0.14
oir
0.14
èĩªåĬ¨çĶŁæĪIJ
0.14
EAR
0.14
kö
0.14
ANGO
0.14
xfff
0.14
Activations Density 0.036%