INDEX
Explanations
sequences of numbers that are arranged in a specific format
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.13
0.4%
876
+0.10
0.3%
674
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
382
+0.13
0.05
1343
+0.10
0.05
523
+0.09
0.03
Negative Logits
shenan
-2.11
impra
-2.04
accla
-2.02
reluct
-2.02
sophistic
-2.01
indestru
-2.00
pamph
-1.99
philanth
-1.98
disagre
-1.98
unspeak
-1.91
POSITIVE LOGITS
DRAWINGS
0.70
0
0.69
PREFERRED
0.64
requerimientos
0.61
cnica
0.60
renovables
0.60
ribune
0.59
fill
0.57
sentimientos
0.56
ടു
0.56
Activations Density 0.158%