INDEX
Explanations
references to geographical locations with specific population or statistical data
references to demographic groups and their social behaviors
New Auto-Interp
Negative Logits
delay
-0.83
timeout
-0.78
Delay
-0.73
Failure
-0.73
éĹĺ
-0.71
Cancel
-0.70
pressure
-0.70
delaying
-0.70
Leaks
-0.70
delay
-0.68
POSITIVE LOGITS
similarities
1.09
ancestry
1.00
origins
0.90
resemb
0.90
stereotypes
0.88
ethnicity
0.84
cultures
0.84
nationality
0.83
synonymous
0.83
anthropology
0.82
Activations Density 1.378%