INDEX
Explanations
phrases related to job roles and workplace scenarios
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.16
0.5%
1343
+0.15
0.4%
453
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1801
+0.16
0.04
286
+0.15
0.03
779
+0.11
0.03
Negative Logits
Inhabitants
-0.94
impelled
-0.91
endeavouring
-0.91
Shakspeare
-0.91
gaily
-0.90
unlaw
-0.83
vainly
-0.83
Pamph
-0.82
McLaugh
-0.82
apprehen
-0.82
POSITIVE LOGITS
solidar
1.10
anse
0.94
RSSSF
0.89
utop
0.86
marte
0.86
glan
0.84
notor
0.84
<bos>
0.83
lomb
0.83
intit
0.83
Activations Density 0.275%