INDEX
Explanations
references to the word "Dawn."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1492
+0.14
0.7%
596
+0.12
0.6%
406
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
406
+0.14
0.02
1590
+0.12
0.02
1512
+0.12
0.02
Negative Logits
<bos>
-1.52
Datuak
-0.74
expandindo
-0.69
Interes
-0.63
AutoScale
-0.62
propOrder
-0.62
XtraEditors
-0.60
Acab
-0.59
RepeatedField
-0.59
IActionResult
-0.59
POSITIVE LOGITS
Dawn
1.28
Dawn
1.20
emphat
1.17
madonna
1.15
dawn
1.12
milf
1.11
casio
1.11
😭😭
1.09
:)))
1.08
hentai
1.08
Activations Density 0.152%