INDEX
Explanations
mentions of degrees, especially bachelor's degrees
references to academic degrees, particularly bachelor's degrees
New Auto-Interp
Negative Logits
andr
-0.86
anwhile
-0.82
DEC
-0.75
arily
-0.73
ppelin
-0.73
Canaver
-0.72
*/(
-0.72
NPR
-0.70
ombie
-0.66
gm
-0.66
POSITIVE LOGITS
bachelor
1.07
achelor
1.02
hood
0.86
uates
0.84
WithNo
0.82
degrees
0.80
achel
0.77
Boot
0.74
cffffcc
0.72
Bachelor
0.72
Activations Density 0.013%