INDEX
Explanations
names of individuals and social media handles
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.22
0.8%
1343
+0.19
0.7%
227
+0.09
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.22
0.05
1275
+0.19
0.04
1097
+0.09
0.04
Negative Logits
<bos>
-2.45
ⓧ
-1.01
Autoritní
-0.80
/**
-0.78
<?
-0.75
-0.74
GEBURTSDATUM
-0.72
RegressionTest
-0.68
Panamoan
-0.62
intios
-0.61
POSITIVE LOGITS
accla
1.05
saar
1.02
kristal
0.99
vne
0.98
kram
0.97
silikon
0.94
Minang
0.94
makro
0.94
Strukt
0.94
keramik
0.93
Activations Density 0.053%