INDEX
Explanations
references to media coverage or commentary
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.09
0.3%
1535
+0.07
0.2%
109
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
613
+0.09
0.05
109
+0.07
0.05
136
+0.07
0.03
Negative Logits
swarovski
-1.11
matel
-1.02
broderie
-1.01
Chapitre
-1.01
milano
-1.00
Février
-0.97
vété
-0.97
tricot
-0.97
Ename
-0.95
Pièces
-0.94
POSITIVE LOGITS
news
0.71
coverage
0.66
newspapers
0.63
reporting
0.63
reports
0.62
reporters
0.61
headlines
0.61
newspaper
0.59
news
0.58
media
0.58
Activations Density 0.445%