INDEX
    Explanations

    mentions of ownership or the concept of being an owner in various contexts

    New Auto-Interp
    Negative Logits
    ute
    -0.20
    ula
    -0.19
    ulla
    -0.16
    ero
    -0.16
    ëĭ¤
    -0.15
    uten
    -0.15
     dozen
    -0.15
    ån
    -0.15
    oning
    -0.15
    iones
    -0.14
    POSITIVE LOGITS
    /operator
    0.31
    /operators
    0.30
    -operator
    0.29
    /man
    0.24
    -manager
    0.17
     trÃŃ
    0.17
    /manage
    0.17
    /admin
    0.16
    -fashioned
    0.16
    lier
    0.16
    Act Density 0.044%

    No Known Activations