INDEX
Explanations
references to athletes and sports players
New Auto-Interp
Negative Logits
ailability
-0.16
ainer
-0.14
rieb
-0.14
Edgar
-0.14
ahoma
-0.14
825
-0.14
poke
-0.14
Laden
-0.14
ikan
-0.14
porter
-0.14
POSITIVE LOGITS
OLEAN
0.15
bud
0.15
arent
0.15
lov
0.14
aptic
0.14
Kraj
0.14
cratch
0.14
Lion
0.13
ahl
0.13
Blonde
0.13
Activations Density 0.011%