INDEX
    Explanations

    references to specific cultural identities or ethnic groups

    New Auto-Interp
    Negative Logits
     nacionales
    -0.54
     ambientales
    -0.54
     fiscales
    -0.54
     conclusiones
    -0.50
     prawa
    -0.50
     nacionais
    -0.50
    сай
    -0.48
     físicos
    -0.48
    äf
    -0.47
     adelantado
    -0.46
    POSITIVE LOGITS
    RenderAtEndOf
    0.72
     виправивши
    0.71
     समीक्षाओं
    0.70
     صوتيه
    0.69
     समीक्षक
    0.68
    клопе
    0.66
    \{\\
    0.65
    ViewFeatures
    0.65
    "];
    
    0.64
    rungsseite
    0.64
    Act Density 0.093%

    No Known Activations