INDEX
Explanations
instances and discussions related to women
New Auto-Interp
Negative Logits
purpoſe
-0.81
Cæsar
-0.79
Theſe
-0.78
pleaſure
-0.77
himſelf
-0.76
iſt
-0.75
Monfieur
-0.74
Majefty
-0.74
itſelf
-0.71
myſelf
-0.71
POSITIVE LOGITS
woman
3.03
women
2.76
Woman
2.71
Woman
2.61
woman
2.57
Women
2.52
women
2.46
Women
2.43
WOMAN
2.36
WOMEN
2.23
Activations Density 0.117%