INDEX
    Explanations

    phrases related to accountability and personal responsibility

    New Auto-Interp
    Negative Logits
    SSID
    -0.16
    ãĤ¤ãĥ¤
    -0.15
     Erotische
    -0.14
    pron
    -0.14
    ardım
    -0.14
    ozÃŃ
    -0.14
     erotik
    -0.14
     Kız
    -0.13
     erotique
    -0.13
    lfw
    -0.13
    POSITIVE LOGITS
     man
    0.25
     owning
    0.22
     ownership
    0.21
     owned
    0.20
     admission
    0.19
     Ownership
    0.19
     Man
    0.18
     ADM
    0.18
    Ownership
    0.18
    ownership
    0.18
    Act Density 0.062%

    No Known Activations