INDEX
Explanations
statements or comments made by notable individuals
phrases related to advocacy and statements made by individuals in positions of authority
New Auto-Interp
Negative Logits
xa
-0.70
psy
-0.70
stabilization
-0.62
totality
-0.59
physical
-0.59
)}
-0.59
brill
-0.58
scaven
-0.58
beginners
-0.58
Subject
-0.58
POSITIVE LOGITS
Jr
0.95
Sr
0.77
uty
0.76
QC
0.76
icer
0.75
ogun
0.74
thouse
0.71
herty
0.70
presiding
0.69
«ĺ
0.69
Activations Density 0.754%