INDEX
Explanations
instances of the word "strike" with varying contexts related to labor disputes or military actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
299
+0.07
0.3%
50
+0.07
0.3%
1676
+0.06
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1676
+0.07
0.06
1385
+0.07
0.08
1870
+0.06
0.00
Negative Logits
<bos>
-1.43
public
-0.67
,
-0.66
/**
-0.66
in
-0.65
-0.65
also
-0.65
protected
-0.64
held
-0.64
//
-0.64
POSITIVE LOGITS
maneu
1.90
stockholm
1.77
affor
1.75
increa
1.74
impra
1.65
jorge
1.64
bandung
1.62
scrat
1.59
jaya
1.58
ricardo
1.57
Activations Density 1.228%