INDEX
    Explanations

    instances and discussions related to women

    New Auto-Interp
    Negative Logits
     purpoſe
    -0.81
     Cæsar
    -0.79
     Theſe
    -0.78
     pleaſure
    -0.77
     himſelf
    -0.76
     iſt
    -0.75
     Monfieur
    -0.74
     Majefty
    -0.74
     itſelf
    -0.71
     myſelf
    -0.71
    POSITIVE LOGITS
     woman
    3.03
     women
    2.76
     Woman
    2.71
    Woman
    2.61
    woman
    2.57
     Women
    2.52
    women
    2.46
    Women
    2.43
     WOMAN
    2.36
     WOMEN
    2.23
    Act Density 0.117%

    No Known Activations