INDEX
Explanations
positive attributes and energetic descriptions related to individuals and their actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
235
+0.21
1.2%
121
+0.12
0.7%
143
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
235
+0.21
0.09
412
+0.12
0.13
23
+0.12
0.16
Negative Logits
ĨĴ
-1.37
heid
-1.32
ĭ
-1.32
iscus
-1.30
³
-1.28
pshire
-1.28
isco
-1.26
icted
-1.25
ucc
-1.25
Thames
-1.23
POSITIVE LOGITS
emphasis
1.47
fear
1.44
plural
1.35
pursuit
1.34
part
1.28
shy
1.27
behaviors
1.27
↵ ↵
1.27
achievement
1.25
ale
1.25
Activations Density 4.974%