INDEX
Explanations
mentions of the country "Russia."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1984
+0.12
0.4%
1053
+0.10
0.4%
517
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
517
+0.12
0.03
1984
+0.10
0.04
795
+0.10
0.03
Negative Logits
kaynağından
-0.55
<bos>
-0.52
ContentAlignment
-0.52
fieldLabel
-0.49
item
-0.49
Paraguay
-0.48
republi
-0.48
Demok
-0.48
Ende
-0.48
insuffisamment
-0.48
POSITIVE LOGITS
Russia
1.13
Rubén
1.07
Áng
1.06
Russian
1.05
russian
1.03
Mónica
1.03
Russia
1.02
russia
1.01
Mejía
1.00
Russians
1.00
Activations Density 0.073%