INDEX
Explanations
mentions of coaching or hiring in sports contexts
New Auto-Interp
Negative Logits
undan
-0.17
dyby
-0.15
маÑħ
-0.14
áy
-0.14
uced
-0.13
ã썿ĢĿãģĨ
-0.13
ç±
-0.13
odian
-0.13
Heck
-0.12
ox
-0.12
POSITIVE LOGITS
joins
0.27
replaces
0.26
succeeds
0.24
previously
0.24
beat
0.23
replace
0.22
Replace
0.21
Previously
0.20
Replace
0.20
jo
0.19
Activations Density 0.071%