INDEX
Explanations
references to the conflict in Syria in different contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1416
+0.19
0.7%
1491
+0.14
0.5%
555
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1416
+0.19
0.06
1491
+0.14
0.04
1464
+0.13
0.04
Negative Logits
<bos>
-0.85
PerformLayout
-0.58
WriteBarrier
-0.58
conlle
-0.50
NKC
-0.49
AUF
-0.48
Тру
-0.48
Hå
-0.46
ceptives
-0.46
lielmo
-0.46
POSITIVE LOGITS
Syria
1.09
Syrian
0.99
Syria
0.95
Syrians
0.92
Sycamore
0.85
Siria
0.76
Darío
0.76
Sy
0.75
Sy
0.75
Sykes
0.69
Activations Density 0.066%