INDEX
Explanations
words related to work and workplace settings
references to employment or job-related activities
New Auto-Interp
Negative Logits
Ukrain
-0.91
Flavoring
-0.77
Russo
-0.69
EStream
-0.69
Bol
-0.69
constitu
-0.68
ĪĴ
-0.66
olic
-0.66
wcs
-0.65
pha
-0.64
POSITIVE LOGITS
ethic
1.18
aday
1.11
station
1.09
bench
1.04
manship
1.03
day
0.98
week
0.98
wear
0.92
hour
0.90
days
0.88
Activations Density 0.065%