INDEX
Explanations
phrases related to sports scores and performance metrics
New Auto-Interp
Negative Logits
etsk
-0.16
nackte
-0.15
hoff
-0.15
Sab
-0.14
adelphia
-0.14
stran
-0.14
SSI
-0.14
Metro
-0.14
acula
-0.14
amar
-0.13
POSITIVE LOGITS
zier
0.15
coe
0.15
dux
0.14
Laure
0.14
arge
0.14
384
0.14
chancellor
0.14
Emb
0.13
Leader
0.13
/error
0.13
Activations Density 0.038%