INDEX
Explanations
first-person experiences and actions related to personal growth and reflection
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.38
1.3%
453
+0.11
0.4%
381
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.38
0.14
862
+0.11
0.09
1415
+0.11
0.10
Negative Logits
<bos>
-1.21
intersper
-0.89
oleo
-0.85
répon
-0.85
soigne
-0.82
cushi
-0.81
exé
-0.79
overcrow
-0.78
tupperware
-0.77
quitted
-0.74
POSITIVE LOGITS
Banten
0.70
Muhamma
0.69
smtplib
0.69
Estou
0.69
alnız
0.69
dónde
0.68
Lampung
0.67
Quiénes
0.66
Ótimo
0.66
GEBURTSDATUM
0.65
Activations Density 1.121%