INDEX
    Explanations

    terms related to personal characteristics or attributes

    New Auto-Interp
    Negative Logits
     personalities
    -0.20
     personality
    -0.17
     Personnel
    -0.16
    eenth
    -0.15
     Personality
    -0.15
    arian
    -0.15
    een
    -0.15
    лиÑĤ
    -0.14
    lero
    -0.14
    andal
    -0.14
    POSITIVE LOGITS
    izable
    0.29
    ization
    0.27
    izing
    0.27
    ised
    0.26
    ty
    0.26
    ities
    0.25
    ise
    0.24
    ized
    0.24
    izes
    0.24
    isation
    0.23
    Act Density 0.037%

    No Known Activations