INDEX
Explanations
proper names of individuals
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
545
+0.12
0.7%
421
+0.12
0.7%
120
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1097
+0.12
0.04
1044
+0.12
0.03
144
+0.12
0.03
Negative Logits
<bos>
-1.78
Enllaces
-0.60
/**
-0.60
ⓧ
-0.57
Cyfeiriadau
-0.57
mobil
-0.56
rid
-0.56
Життєпис
-0.55
Cecil
-0.54
<?
-0.53
POSITIVE LOGITS
dave
1.61
Dave
1.51
Dave
1.44
dave
1.19
bandeau
1.11
beaute
1.03
swarovski
1.02
blackpink
1.00
pettico
0.98
vété
0.97
Activations Density 0.395%