INDEX
Explanations
terms associated with systemic oppression and resistance
New Auto-Interp
Negative Logits
desmotivaciones
-0.57
posesión
-0.54
Problemas
-0.54
Infór
-0.51
PROBLE
-0.49
Drogen
-0.49
Probleme
-0.49
Regeln
-0.49
MLLoader
-0.48
Glied
-0.48
POSITIVE LOGITS
supers
0.54
Supers
0.52
fantastic
0.50
geweldige
0.46
Supers
0.44
fabulous
0.43
SLP
0.43
vPvB
0.42
fantastic
0.40
supers
0.40
Activations Density 0.837%