INDEX
Explanations
phrases related to workplace issues and actions
references to work and employment
New Auto-Interp
Negative Logits
wcs
-0.72
Bour
-0.72
Ukrain
-0.70
constitu
-0.70
EStream
-0.66
Flavoring
-0.65
anamo
-0.63
Russo
-0.60
Vi
-0.60
Yen
-0.60
POSITIVE LOGITS
bench
1.34
station
1.18
ethic
1.14
fare
1.12
aday
1.11
manship
1.08
horse
1.07
hops
1.04
flows
1.00
work
0.98
Activations Density 0.075%