INDEX
Explanations
phrases related to political or regulatory actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1842
+0.13
0.4%
394
+0.11
0.3%
468
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
946
+0.13
0.06
1553
+0.11
0.07
100
+0.11
0.05
Negative Logits
<bos>
-0.78
marginVertical
-0.58
EventQueue
-0.57
incarcer
-0.56
donc
-0.56
marginHorizontal
-0.56
setOpaque
-0.56
جغرافيا
-0.54
avoit
-0.51
BeginContext
-0.51
POSITIVE LOGITS
religione
0.68
liev
0.67
any
0.62
scienza
0.62
ciga
0.61
borsa
0.59
medes
0.59
levis
0.58
ananas
0.58
kokos
0.56
Activations Density 0.605%