INDEX
Explanations
references to various coaches across different sports
references to sports coaches
New Auto-Interp
Negative Logits
yuan
-0.66
flaw
-0.65
corridor
-0.64
selves
-0.64
conn
-0.63
allowances
-0.62
poss
-0.62
recipients
-0.61
theless
-0.61
awatts
-0.61
POSITIVE LOGITS
Bruce
0.91
Randy
0.91
Gregg
0.90
Mike
0.86
Dana
0.85
Steve
0.85
Todd
0.84
Dave
0.84
Pete
0.83
Luke
0.82
Activations Density 0.042%