INDEX
Explanations
phrases related to employment and workplace issues
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1573
+0.16
0.6%
241
+0.14
0.5%
597
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1573
+0.16
0.03
241
+0.14
0.02
196
+0.12
0.02
Negative Logits
macrop
-0.62
impractica
-0.60
inext
-0.59
muco
-0.57
indor
-0.51
laminar
-0.51
reluct
-0.50
disgra
-0.49
concier
-0.49
unden
-0.48
POSITIVE LOGITS
employment
1.29
Employment
1.20
employment
1.07
employ
1.04
Employment
1.02
employed
1.00
employ
0.97
employs
0.91
employed
0.90
employing
0.88
Activations Density 0.052%