INDEX
    Explanations

    references to the term "man" in various contexts

    New Auto-Interp
    Negative Logits
    iesen
    -0.21
    gen
    -0.18
    go
    -0.16
    ë´
    -0.16
    ammers
    -0.15
    born
    -0.15
    ted
    -0.15
    grade
    -0.15
    й
    -0.15
    kart
    -0.15
    POSITIVE LOGITS
    agements
    0.23
    hattan
    0.23
    iac
    0.22
    ifold
    0.19
    atee
    0.18
    agment
    0.18
    agers
    0.18
    chester
    0.18
    UEL
    0.18
    e
    0.18
    Act Density 0.126%

    No Known Activations