INDEX
    Explanations

    content related to social justice and human rights issues

    New Auto-Interp
    Negative Logits
    hong
    -0.15
    ãĥ¼ãĥ«ãĥī
    -0.14
    inati
    -0.14
    .zh
    -0.13
     peasants
    -0.13
    agi
    -0.13
    ]={↵
    -0.13
    pret
    -0.13
     \"$
    -0.13
    ajas
    -0.13
    POSITIVE LOGITS
     Integration
    0.38
     integration
    0.36
    Integration
    0.35
    integration
    0.31
     immigrants
    0.31
    _integration
    0.30
     immigration
    0.30
     Migration
    0.29
     immigrant
    0.29
     migrants
    0.28
    Act Density 0.029%

    No Known Activations