INDEX
Explanations
temporal references and numerical patterns
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.16
0.7%
1343
+0.13
0.6%
1978
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
504
+0.16
0.10
88
+0.13
0.09
1343
+0.08
0.08
Negative Logits
<bos>
-2.99
ⓧ
-0.89
/**
-0.69
},[])
-0.68
Autoritní
-0.68
الرياضيه
-0.67
HideFlags
-0.66
intios
-0.65
betweenstory
-0.64
/*
-0.64
POSITIVE LOGITS
maneu
1.40
unlaw
1.38
affor
1.30
resear
1.25
stockholm
1.24
impra
1.24
accla
1.23
increa
1.22
impractica
1.21
philanth
1.19
Activations Density 0.243%