INDEX
Explanations
references to sports teams and their performance in competitions
New Auto-Interp
Negative Logits
å¦ĩ
-0.08
igure
-0.07
PIP
-0.07
婦
-0.07
女人
-0.07
okus
-0.07
å©
-0.07
arters
-0.07
pregnant
-0.06
imb
-0.06
POSITIVE LOGITS
underage
0.08
girls
0.08
bunk
0.08
Youth
0.07
boys
0.07
youth
0.07
youthful
0.07
Mädchen
0.07
developmental
0.06
girls
0.06
Activations Density 0.013%