INDEX
    Explanations

    references to power dynamics and social inequalities

    New Auto-Interp
    Negative Logits
     viewDidLoad
    -0.59
    openConnection
    -0.56
    kkelen
    -0.54
    nologue
    -0.54
     useParams
    -0.54
     useSelector
    -0.53
     Ανακτήθηκε
    -0.52
    isome
    -0.52
    Pops
    -0.52
    quedas
    -0.52
    POSITIVE LOGITS
     own
    0.82
     Own
    0.72
    Own
    0.71
    自分も
    0.69
     eigener
    0.66
     zelf
    0.60
     eigene
    0.59
     selbst
    0.59
     selber
    0.58
    own
    0.57
    Act Density 0.285%

    No Known Activations