INDEX
Explanations
references to teenage behaviors and societal issues
New Auto-Interp
Negative Logits
Retired
-0.55
suaminya
-0.54
AxisAlignment
-0.52
Viitteet
-0.52
Retired
-0.49
retired
-0.49
zewnętrzne
-0.49
Familienname
-0.47
retired
-0.47
imetric
-0.46
POSITIVE LOGITS
teens
1.03
adolescence
1.01
student
0.97
teenage
0.97
adolescent
0.96
teenagers
0.96
adolescents
0.96
teenager
0.91
youth
0.91
teen
0.90
Activations Density 0.558%