INDEX
Explanations
unusual or significant occurrences referenced in a temporal context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.34
1.2%
381
+0.12
0.4%
690
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
381
+0.34
0.10
1959
+0.12
0.14
2044
+0.11
0.14
Negative Logits
<bos>
-2.66
{}
-0.63
interessa
-0.61
<!--
-0.57
};*/
-0.57
Organisateur
-0.56
contentLoaded
-0.56
///**
-0.56
//*/
-0.55
occorre
-0.55
POSITIVE LOGITS
Occasionally
1.09
oftentimes
1.08
Occasionally
1.05
sometimes
1.04
Sometimes
1.00
Often
0.95
usually
0.94
Usually
0.94
occasionally
0.92
sometimes
0.92
Activations Density 2.700%