INDEX
Explanations
proper names of individuals
references to people's roles, titles, or positions of authority
New Auto-Interp
Negative Logits
trunc
-0.67
SPONSORED
-0.65
Split
-0.63
inosaur
-0.63
Contents
-0.62
dstg
-0.62
netflix
-0.62
calendar
-0.62
EXP
-0.62
Tube
-0.61
POSITIVE LOGITS
spokesman
0.92
PhD
0.92
spokesperson
0.91
Jr
0.90
researcher
0.86
spokeswoman
0.86
lecturer
0.85
Managing
0.85
dean
0.82
professor
0.82
Activations Density 0.157%