INDEX
Explanations
information about specific regions or countries, especially England and Wales
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
172
+0.17
0.8%
122
+0.14
0.7%
101
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
172
+0.17
0.04
1910
+0.14
0.03
101
+0.13
0.03
Negative Logits
<bos>
-1.03
qiao
-0.63
kwiet
-0.63
exé
-0.61
Mejora
-0.57
Simplemente
-0.55
Dữ
-0.55
ಮ್
-0.54
xiu
-0.54
="#"><
-0.54
POSITIVE LOGITS
England
1.14
England
1.14
england
0.99
ENGLAND
0.93
england
0.89
Inglaterra
0.87
ENGL
0.73
Engl
0.72
Angleterre
0.72
uncin
0.72
Activations Density 0.384%