INDEX
Explanations
descriptions of types of personnel, likely in a professional context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
513
+0.09
0.3%
908
+0.08
0.2%
2034
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
908
+0.09
0.03
513
+0.08
0.04
1263
+0.08
0.05
Negative Logits
ftu
-1.30
fep
-1.29
effe
-1.27
increa
-1.26
mef
-1.25
aen
-1.25
volunte
-1.25
affez
-1.23
fta
-1.23
fup
-1.20
POSITIVE LOGITS
who
0.92
whom
0.81
who
0.75
whose
0.72
from
0.70
الذين
0.66
Who
0.64
willing
0.64
hips
0.63
Who
0.62
Activations Density 0.517%