INDEX
Explanations
mentions of academic titles and positions, especially those related to educational institutions
references to academic positions or titles
New Auto-Interp
Negative Logits
orsi
-0.73
ABE
-0.73
bley
-0.70
Labrador
-0.67
PsyNetMessage
-0.67
Freedom
-0.66
ighters
-0.65
pire
-0.63
Hobby
-0.62
rights
-0.60
POSITIVE LOGITS
dean
1.10
onym
0.96
ctor
0.82
ials
0.79
ially
0.77
clair
0.73
emer
0.73
ofi
0.73
alum
0.69
stown
0.69
Activations Density 0.007%