INDEX
Explanations
phrases related to past employment and job roles
phrases describing occupations or roles
New Auto-Interp
Negative Logits
oche
-0.75
yrs
-0.74
redits
-0.70
tons
-0.68
/$
-0.66
rous
-0.65
rogens
-0.62
ALS
-0.62
raq
-0.62
ru
-0.62
POSITIVE LOGITS
pires
1.19
pired
1.09
an
1.00
a
0.94
deputy
0.91
ambassador
0.90
curator
0.88
assistant
0.86
director
0.86
translator
0.86
Activations Density 0.089%