INDEX
Explanations
alerts and warnings in text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
61
+0.15
0.6%
313
+0.15
0.5%
596
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
61
+0.15
0.02
313
+0.15
0.02
1363
+0.13
0.02
Negative Logits
tomans
-0.50
iocene
-0.47
LoggerFactory
-0.46
fusc
-0.46
inoco
-0.45
גרפיה
-0.45
يميديا
-0.45
thier
-0.43
UUM
-0.42
rrh
-0.42
POSITIVE LOGITS
alert
1.30
alerting
1.19
alerts
1.19
Alerts
1.15
Alert
1.13
alert
1.08
alerted
1.07
Alert
0.99
alerts
0.99
alertness
0.97
Activations Density 0.070%