INDEX
Explanations
political and geographical terms related to regions and international relations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
906
+0.10
0.3%
1343
+0.10
0.3%
609
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
939
+0.10
0.04
609
+0.10
0.03
1511
+0.09
0.01
Negative Logits
لينك
-0.53
forChild
-0.49
("]");-0.47
(',');-0.46
={`/-0.46
growled
-0.45
będę
-0.45
(")");-0.44
(",");-0.44
jerked
-0.43
POSITIVE LOGITS
ftu
0.70
laft
0.68
meuble
0.68
fup
0.65
nuage
0.65
cimetière
0.63
pavillon
0.63
tranf
0.63
accessoire
0.63
vœ
0.63
Activations Density 0.178%