INDEX
Explanations
phrases related to progressive actions or initiatives
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
251
+0.12
0.5%
68
+0.12
0.5%
892
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
251
+0.12
0.03
1512
+0.12
0.02
1372
+0.12
0.02
Negative Logits
setNome
-0.47
Brainz
-0.45
Mur
-0.45
how
-0.44
<em>
-0.43
<i>
-0.42
babwe
-0.42
Пар
-0.41
tiktok
-0.41
Exemplo
-0.41
POSITIVE LOGITS
tille
1.09
ADVANCED
0.99
Advance
0.98
ADVANCE
0.93
meis
0.90
Advance
0.90
jorge
0.88
leonardo
0.87
herre
0.87
advance
0.87
Activations Density 0.074%