INDEX
Explanations
mentions of a specific publication, The Guardian
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
795
+0.15
0.6%
1618
+0.15
0.6%
1741
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
795
+0.15
0.02
1493
+0.15
0.01
214
+0.14
0.01
Negative Logits
ôtel
-0.57
tagHelperRunner
-0.48
requipa
-0.48
äsident
-0.47
flé
-0.46
CardBody
-0.45
esternos
-0.45
larged
-0.44
losseum
-0.44
PILE
-0.44
POSITIVE LOGITS
Guardian
1.29
Guardian
1.24
guardian
1.08
Guardians
1.08
guardian
1.07
theguardian
1.00
guardians
0.93
Guardians
0.90
GUARD
0.85
GUARD
0.74
Activations Density 0.105%