INDEX
    Explanations

    words and phrases associated with official positions, roles, or titles

    New Auto-Interp
    Negative Logits
    ванов
    -0.16
    enheim
    -0.15
    eco
    -0.15
    VICE
    -0.14
    Į
    -0.14
     závod
    -0.14
     McG
    -0.14
     analogy
    -0.14
    à¹Ģà¸Ńà¸ĩ
    -0.14
    esity
    -0.13
    POSITIVE LOGITS
     Person
    0.19
     Rel
    0.17
     person
    0.17
     rel
    0.17
    kategori
    0.16
    ilyn
    0.16
     kategor
    0.15
    avl
    0.15
    Person
    0.15
    avn
    0.15
    Act Density 0.013%

    No Known Activations