INDEX
Explanations
references to coaching and sports-related promotions
New Auto-Interp
Negative Logits
ActionCreators
-0.18
_mgmt
-0.16
adele
-0.15
apore
-0.15
Creators
-0.15
pNet
-0.14
ÑģÑĤвоÑĢ
-0.14
IJ
-0.14
çĽĸ
-0.14
BindingUtil
-0.14
POSITIVE LOGITS
assistant
0.30
Assistant
0.27
assistants
0.23
Assistant
0.22
coach
0.21
Assist
0.20
528
0.19
assistant
0.19
coaching
0.19
coaches
0.19
Activations Density 0.041%