INDEX
Explanations
references to fans of various sports teams and games
New Auto-Interp
Negative Logits
divisions
-0.68
ãģĤ
-0.62
embassies
-0.61
penalties
-0.61
kingdoms
-0.60
men
-0.60
runners
-0.60
MEN
-0.60
balls
-0.60
dancers
-0.59
POSITIVE LOGITS
extraord
0.96
digy
0.91
myself
0.81
rette
0.77
turned
0.76
abilia
0.76
rejoice
0.75
chuk
0.73
nonetheless
0.71
blogger
0.70
Activations Density 0.065%