INDEX
Explanations
phrases related to political announcements and events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
856
+0.09
0.3%
942
+0.09
0.3%
227
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
284
+0.09
0.06
1336
+0.09
0.06
1700
+0.08
0.04
Negative Logits
effe
-1.37
!...
-1.33
aen
-1.30
tranf
-1.27
desir
-1.24
ftu
-1.22
fta
-1.17
sii
-1.17
?...
-1.17
emphat
-1.17
POSITIVE LOGITS
tomorrow
1.40
next
1.09
upcoming
1.05
soon
1.03
tonight
0.94
tomorrow
0.93
Tomorrow
0.90
forthcoming
0.84
next
0.83
Tomorrow
0.81
Activations Density 0.506%