INDEX
Explanations
conditional statements using the word "if."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1370
+0.10
0.3%
1482
+0.10
0.3%
1438
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1370
+0.10
0.03
1482
+0.10
0.03
1307
+0.09
0.02
Negative Logits
mybatisplus
-0.60
robus
-0.56
combusti
-0.54
anterie
-0.51
ABASES
-0.50
slidesPer
-0.49
droj
-0.49
senz
-0.49
thello
-0.48
bitat
-0.47
POSITIVE LOGITS
tolerably
0.70
unwarran
0.70
gaily
0.64
vainly
0.63
withal
0.61
imperfectly
0.61
liberality
0.59
unspeak
0.58
unlaw
0.58
caprice
0.58
Activations Density 0.110%