INDEX
Explanations
words related to relationships with coworkers
references to workers and their experiences
New Auto-Interp
Negative Logits
format
-0.72
overt
-0.70
offence
-0.66
preseason
-0.63
multiplayer
-0.61
genre
-0.60
zoom
-0.60
dis
-0.59
obfusc
-0.59
amphib
-0.59
POSITIVE LOGITS
workers
4.25
worker
3.61
working
1.79
Workers
1.73
Worker
1.56
work
1.47
workers
1.42
Work
1.41
worker
1.37
WORK
1.31
Activations Density 0.011%