INDEX
Explanations
words related to ambition and determination
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.27
0.8%
1967
+0.15
0.5%
198
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1397
+0.27
0.02
1166
+0.15
0.07
198
+0.13
0.05
Negative Logits
Considerable
-0.81
Illus
-0.74
Byp
-0.73
Dangers
-0.72
Pursu
-0.68
Appears
-0.67
Seem
-0.66
Eqn
-0.66
Renewed
-0.66
Sugges
-0.65
POSITIVE LOGITS
<bos>
1.55
affez
1.25
parteci
1.18
soggior
1.07
auguri
1.06
sappi
1.05
abbra
1.04
interessanti
1.03
paillettes
1.03
ridu
1.02
Activations Density 0.661%