INDEX
Explanations
phrases related to military threats or strategic considerations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2016
+0.11
0.3%
678
+0.10
0.3%
2034
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1081
+0.11
0.04
1705
+0.10
0.04
647
+0.09
0.03
Negative Logits
Ferdin
-0.71
erk
-0.68
traktor
-0.66
bera
-0.63
heti
-0.63
pank
-0.60
saar
-0.60
akut
-0.58
Öster
-0.57
Jä
-0.57
POSITIVE LOGITS
<bos>
0.57
Personensuche
0.56
hvit
0.49
zc
0.47
cherchez
0.47
Bekasi
0.47
voyons
0.47
nmax
0.46
devaient
0.46
découver
0.45
Activations Density 0.187%