INDEX
Explanations
mentions of different colors and faiths, particularly in a social context
New Auto-Interp
Negative Logits
Quantity
-0.71
alloc
-0.69
EVA
-0.67
sensations
-0.66
washer
-0.65
Examples
-0.64
Deal
-0.63
Prosecut
-0.62
ebin
-0.61
Features
-0.61
POSITIVE LOGITS
whom
1.27
attendance
0.76
pires
0.75
who
0.71
whose
0.70
hran
0.69
backgrounds
0.69
pired
0.69
alike
0.68
professions
0.68
Activations Density 1.436%