INDEX
Explanations
references to girls and women
mentions of "Girls" within various contexts
New Auto-Interp
Negative Logits
convincing
-0.77
utherford
-0.75
SPONSORED
-0.73
loud
-0.71
Blumenthal
-0.71
ype
-0.69
Kaine
-0.67
shaking
-0.64
ital
-0.64
ypes
-0.63
POSITIVE LOGITS
Girls
1.45
Girls
1.35
Actress
0.91
Girl
0.90
Boys
0.90
Haram
0.87
Apps
0.87
poons
0.86
glers
0.85
Fighters
0.85
Activations Density 0.017%