INDEX
Explanations
references to athletic activities and training
New Auto-Interp
Negative Logits
puck
-0.15
丸
-0.15
rode
-0.14
pitched
-0.14
Mane
-0.14
Motorcycle
-0.14
Riding
-0.14
comb
-0.13
cascade
-0.13
motorcycle
-0.13
POSITIVE LOGITS
running
0.39
runners
0.38
Running
0.38
running
0.36
Running
0.36
runner
0.35
Runner
0.34
RUNNING
0.33
race
0.33
-running
0.31
Activations Density 0.122%