INDEX
Explanations
statements related to announcements or unveiling of information
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.09
0.3%
856
+0.09
0.2%
690
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
335
+0.09
0.04
330
+0.09
0.03
836
+0.09
0.02
Negative Logits
aquarelle
-1.17
lele
-1.06
!...
-1.02
aen
-1.02
NOO
-1.01
levis
-1.00
casio
-0.99
centrif
-0.98
Juf
-0.98
?...
-0.98
POSITIVE LOGITS
hope
0.81
await
0.78
predict
0.74
wait
0.72
hopefully
0.70
waiting
0.70
predictions
0.70
awaits
0.70
hoping
0.69
anticipate
0.69
Activations Density 0.315%