INDEX
    Explanations

    references to societal issues and racial dynamics, particularly surrounding ownership and identity

    New Auto-Interp
    Negative Logits
    LabelTagHelper
    -0.48
    かに
    -0.47
     view
    -0.45
    GEBURTS
    -0.45
    -0.45
     my
    -0.42
     alábbi
    -0.42
     aras
    -0.42
     encuentre
    -0.41
     üzere
    -0.41
    POSITIVE LOGITS
     cherchés
    0.75
    Бахар
    0.75
    GOTREF
    0.74
    ValueStyle
    0.73
    Autoritní
    0.73
     lenker
    0.72
     kasarigan
    0.72
    expandindo
    0.72
     Administrativna
    0.72
     autorytatywna
    0.71
    Act Density 0.398%

    No Known Activations