INDEX
Explanations
mentions of colleagues or coworkers
the term "colleagues" in various contexts
New Auto-Interp
Negative Logits
aval
-0.79
Ukrain
-0.70
ball
-0.68
si
-0.65
uder
-0.65
iculture
-0.64
amac
-0.64
istic
-0.63
forest
-0.63
direction
-0.62
POSITIVE LOGITS
colleagues
1.11
colleague
1.03
coworkers
0.86
ority
0.74
ratulations
0.71
classmates
0.71
utenant
0.70
roomm
0.68
cowork
0.67
tasked
0.67
Activations Density 0.021%