INDEX
Explanations
mentions of or related to girls
references to girls in various contexts
New Auto-Interp
Negative Logits
Component
-0.75
Gutenberg
-0.66
uron
-0.64
Mark
-0.64
urden
-0.63
License
-0.63
Render
-0.61
inct
-0.61
ulkan
-0.60
ector
-0.60
POSITIVE LOGITS
girls
3.63
girls
2.81
Girls
2.73
Girls
2.64
boys
2.55
girl
2.47
females
2.17
daughters
2.08
women
2.02
ladies
1.99
Activations Density 0.020%