INDEX
Explanations
references to societal issues and racial dynamics, particularly surrounding ownership and identity
New Auto-Interp
Negative Logits
LabelTagHelper
-0.48
かに
-0.47
view
-0.45
GEBURTS
-0.45
몇
-0.45
my
-0.42
alábbi
-0.42
aras
-0.42
encuentre
-0.41
üzere
-0.41
POSITIVE LOGITS
cherchés
0.75
Бахар
0.75
GOTREF
0.74
ValueStyle
0.73
Autoritní
0.73
lenker
0.72
kasarigan
0.72
expandindo
0.72
Administrativna
0.72
autorytatywna
0.71
Activations Density 0.398%