INDEX
Explanations
phrases related to military drills and disciplinary actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
906
+0.14
0.4%
1959
+0.12
0.4%
1843
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1959
+0.14
0.09
1009
+0.12
0.03
2016
+0.11
0.06
Negative Logits
介入
-0.54
They
-0.54
It
-0.52
They
-0.51
抽出
-0.51
about
-0.50
全集
-0.49
孤立
-0.49
It
-0.49
留意
-0.48
POSITIVE LOGITS
megane
1.21
swarovski
1.19
thermomix
1.18
sappi
1.17
broderie
1.16
tricot
1.14
paillettes
1.13
chèvre
1.12
cabrio
1.07
Angleterre
1.05
Activations Density 1.344%