INDEX
Explanations
mentions of a specific brand or company name
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1565
+0.11
0.3%
544
+0.11
0.3%
369
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1120
+0.11
0.04
184
+0.11
0.02
690
+0.09
0.04
Negative Logits
Herrick
-0.74
McLaugh
-0.68
<bos>
-0.67
Unger
-0.66
McFar
-0.64
Kearns
-0.64
McInt
-0.64
Kruse
-0.57
Hickey
-0.57
inform
-0.57
POSITIVE LOGITS
Ottobre
1.59
Baldwin
1.58
broder
1.54
Settembre
1.49
cannes
1.49
marseille
1.48
Traité
1.42
Luglio
1.41
tyn
1.41
chèvre
1.39
Activations Density 0.299%