INDEX
Explanations
people with academic or professional titles
titles and roles related to academic and professional positions
New Auto-Interp
Negative Logits
Blacks
-0.67
Hague
-0.67
Beasts
-0.63
attacks
-0.62
rums
-0.58
bathrooms
-0.58
Carnage
-0.57
veyard
-0.57
cloves
-0.57
ems
-0.57
POSITIVE LOGITS
member
0.92
of
0.92
specializing
0.90
aboard
0.85
at
0.83
Fellow
0.82
attending
0.78
advisor
0.77
adviser
0.75
studying
0.74
Activations Density 0.153%