INDEX
    Explanations

    words indicating age and gender

    New Auto-Interp
    Negative Logits
    dv
    -0.59
    出版年
    -0.59
    съ
    -0.58
    въ
    -0.57
    chaus
    -0.56
    Väl
    -0.56
     Tran
    -0.55
    -0.55
    usehen
    -0.54
     vš
    -0.54
    POSITIVE LOGITS
     Efq
    0.99
     Jefus
    0.84
     Monfieur
    0.80
    toMatchSnapshot
    0.79
     myſelf
    0.78
     holotype
    0.78
    PMailer
    0.77
     whoſe
    0.75
     betweenstory
    0.74
    mergeFrom
    0.73
    Act Density 0.100%

    No Known Activations