INDEX
    Explanations

    mentions of European nationalities

    references to specific nationalities or ethnicities

    New Auto-Interp
    Negative Logits
    odder
    -0.86
    icago
    -0.85
    nyder
    -0.85
    utherford
    -0.84
    ertodd
    -0.83
    affles
    -0.81
    mble
    -0.81
    iscons
    -0.79
    ividual
    -0.79
    uder
    -0.78
    POSITIVE LOGITS
    oslov
    0.98
     Nadu
    0.89
     nationals
    0.88
     shepherd
    0.84
     translation
    0.83
     cuisine
    0.82
     accent
    0.81
     proverb
    0.81
     Portuguese
    0.79
    istani
    0.78
    Act Density 0.139%

    No Known Activations