INDEX
Explanations
references to virulence in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
450
+0.13
0.7%
410
+0.12
0.7%
303
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
450
+0.13
0.01
111
+0.12
0.01
291
+0.11
0.01
Negative Logits
different
-1.64
our
-1.48
same
-1.40
much
-1.37
son
-1.33
growth
-1.32
scribe
-1.31
rapid
-1.31
cer
-1.29
pure
-1.28
POSITIVE LOGITS
ariate
2.20
ulent
2.10
ĻĤ
2.04
ģ
1.98
gins
1.87
idian
1.85
ulus
1.82
ulence
1.75
imes
1.68
iously
1.67
Activations Density 0.010%