INDEX
Explanations
international political and geographical terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
304
+0.11
0.3%
678
+0.11
0.3%
1870
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
523
+0.11
0.03
2021
+0.11
0.03
8
+0.10
0.03
Negative Logits
htbp
-0.55
of
-0.54
<bos>
-0.54
so
-0.53
just
-0.52
for
-0.50
in
-0.50
at
-0.50
过期
-0.49
on
-0.49
POSITIVE LOGITS
praktik
1.36
alkoh
1.32
keramik
1.30
kosme
1.30
antik
1.28
Strukt
1.26
konkre
1.24
Simult
1.20
silikon
1.18
mikrofon
1.16
Activations Density 0.179%