INDEX
Explanations
discussions about sensitive social and racial issues
New Auto-Interp
Negative Logits
клопе
-0.58
IntoConstraints
-0.55
__(/*!
-0.55
övers
-0.55
ächlich
-0.54
Rossa
-0.52
appetizer
-0.52
Katso
-0.51
وفة
-0.51
ERSHIP
-0.50
POSITIVE LOGITS
people
0.98
tragedies
0.86
politicians
0.83
sadly
0.82
infeliz
0.81
purtroppo
0.80
Sadly
0.79
incidents
0.79
Sadly
0.76
parents
0.75
Activations Density 0.410%