INDEX
Explanations
references to power dynamics and social justice issues
New Auto-Interp
Negative Logits
nakalista
-0.73
apunov
-0.68
betweenstory
-0.68
IntoConstraints
-0.67
חיצוניים
-0.64
ANDUM
-0.60
GEBURTSDATUM
-0.60
躇
-0.58
Мексичка
-0.57
Manbalar
-0.57
POSITIVE LOGITS
hypo
0.54
ciless
0.52
responsabile
0.49
hypocritical
0.49
jScrollPane
0.49
продъл
0.48
ystem
0.47
persistent
0.47
hipo
0.46
terus
0.46
Activations Density 0.684%