INDEX
Explanations
references to power dynamics and social inequalities
New Auto-Interp
Negative Logits
viewDidLoad
-0.59
openConnection
-0.56
kkelen
-0.54
nologue
-0.54
useParams
-0.54
useSelector
-0.53
Ανακτήθηκε
-0.52
isome
-0.52
Pops
-0.52
quedas
-0.52
POSITIVE LOGITS
own
0.82
Own
0.72
Own
0.71
自分も
0.69
eigener
0.66
zelf
0.60
eigene
0.59
selbst
0.59
selber
0.58
own
0.57
Activations Density 0.285%