INDEX
Explanations
elements related to societal expectations and norms, particularly regarding gender roles and marriage
New Auto-Interp
Negative Logits
burrito
-0.48
trucker
-0.48
funky
-0.46
hamburger
-0.46
pizzeria
-0.45
burger
-0.44
tacos
-0.44
Pizzeria
-0.43
vété
-0.43
sweats
-0.42
POSITIVE LOGITS
aristocratic
0.75
gover
0.74
Lady
0.73
servants
0.72
Victorian
0.71
Regency
0.69
Viscount
0.68
Lady
0.66
arist
0.66
Victorian
0.64
Activations Density 0.674%