INDEX
    Explanations

    mentions of different colors and faiths, particularly in a social context

    New Auto-Interp
    Negative Logits
    Quantity
    -0.71
    alloc
    -0.69
    EVA
    -0.67
     sensations
    -0.66
    washer
    -0.65
    Examples
    -0.64
    Deal
    -0.63
    Prosecut
    -0.62
    ebin
    -0.61
    Features
    -0.61
    POSITIVE LOGITS
     whom
    1.27
     attendance
    0.76
    pires
    0.75
    who
    0.71
    whose
    0.70
    hran
    0.69
     backgrounds
    0.69
    pired
    0.69
     alike
    0.68
     professions
    0.68
    Act Density 1.436%

    No Known Activations