INDEX
Explanations
mentions and details about different types of engines
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.16
1.0%
1604
+0.12
0.8%
1034
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1604
+0.16
0.02
1437
+0.12
0.02
1974
+0.11
0.02
Negative Logits
<bos>
-3.32
reinstate
-0.74
inaugurate
-0.73
defray
-0.72
mobilize
-0.70
rehabilitate
-0.70
rejoined
-0.70
disbur
-0.70
strode
-0.68
recollect
-0.68
POSITIVE LOGITS
Engine
1.18
engine
1.17
engines
1.10
Engines
1.09
Minang
1.09
Engine
1.07
maroc
1.07
Engines
1.04
ENGINE
1.04
engines
1.04
Activations Density 0.071%