INDEX
Explanations
phrases related to personal experiences and emotions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1445
+0.11
0.3%
1381
+0.10
0.3%
2034
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1265
+0.11
0.05
1445
+0.10
0.06
1592
+0.09
0.03
Negative Logits
Settembre
-1.06
Ottobre
-1.03
Février
-1.02
Giugno
-1.02
Luglio
-0.99
sappi
-0.98
fua
-0.92
quæ
-0.92
vorrei
-0.92
cæ
-0.91
POSITIVE LOGITS
doing
0.97
trying
0.95
making
0.95
getting
0.90
giving
0.87
preparing
0.86
looking
0.85
creating
0.85
putting
0.85
providing
0.84
Activations Density 0.453%