INDEX
Explanations
phrases about self-improvement and personal growth
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.15
0.5%
1533
+0.11
0.3%
1352
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.15
0.05
275
+0.11
0.03
1415
+0.10
0.02
Negative Logits
Erreferentziak
-0.53
存于互联网档案馆
-0.53
fign
-0.53
ftre
-0.49
droj
-0.48
Qw
-0.48
Einzelnachweise
-0.47
<^
-0.47
«<
-0.47
Yg
-0.46
POSITIVE LOGITS
eccell
0.67
affez
0.62
piacevole
0.58
0.58
ecru
0.58
ⓧ
0.55
tutt
0.55
/**
0.53
purtroppo
0.53
eccellente
0.53
Activations Density 0.411%