INDEX
Explanations
contact and address information
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.18
0.5%
674
+0.15
0.4%
24
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.18
0.05
1527
+0.15
0.03
382
+0.07
0.03
Negative Logits
pamph
-1.04
inappro
-0.92
shenan
-0.91
unwarran
-0.87
Daven
-0.82
Illus
-0.79
impra
-0.79
racon
-0.78
unspeak
-0.78
Pamph
-0.77
POSITIVE LOGITS
pama
1.08
kamb
0.90
bambu
0.85
siena
0.84
saul
0.82
susun
0.78
complished
0.77
gela
0.77
kuku
0.76
karte
0.75
Activations Density 0.137%