INDEX
Explanations
terms related to boys or male individuals
instances of the word "boys" and related variations indicating a group of male children or adolescents
New Auto-Interp
Negative Logits
kson
-0.73
olson
-0.69
atively
-0.67
uits
-0.65
palms
-0.64
Fallon
-0.61
canopy
-0.60
trial
-0.60
ukemia
-0.59
urses
-0.59
POSITIVE LOGITS
awei
0.90
rahim
0.87
inary
0.82
Bey
0.81
dinand
0.78
Wan
0.77
boy
0.71
Mag
0.70
doms
0.70
rieved
0.69
Activations Density 0.027%