INDEX
Explanations
instances of the word "work" used in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
144
+0.13
0.4%
1993
+0.13
0.4%
241
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1993
+0.13
0.06
144
+0.13
0.05
169
+0.12
0.04
Negative Logits
radikal
-0.54
zyn
-0.53
sonder
-0.52
minimalis
-0.52
krab
-0.51
akade
-0.49
ideolog
-0.47
Reiz
-0.47
rü
-0.46
kosme
-0.46
POSITIVE LOGITS
Work
1.05
Work
1.04
work
1.02
work
1.01
WORK
1.01
WORK
0.93
Worked
0.86
pixar
0.85
workday
0.84
hairc
0.81
Activations Density 0.123%