INDEX
Explanations
text related to forums and online discussions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.19
1.1%
50
+0.15
0.8%
1741
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
478
+0.19
0.18
1224
+0.15
0.09
2019
+0.12
0.14
Negative Logits
<bos>
-2.39
ⓧ
-1.15
/***
-1.11
<?
-1.08
-1.02
springfox
-0.98
<?
-0.94
/*!
-0.89
/**
-0.86
#![
-0.79
POSITIVE LOGITS
véhic
0.77
soulign
0.71
TokenType
0.69
'
0.68
uxx
0.65
Pamph
0.65
dison
0.64
arture
0.64
unspeak
0.64
gnition
0.63
Activations Density 1.294%