INDEX
Explanations
terms related to efficiency, power consumption, and operational states of machines
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
443
+0.07
0.2%
1980
+0.06
0.2%
1731
+0.06
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1395
+0.07
0.04
522
+0.06
0.04
1273
+0.06
0.02
Negative Logits
disagre
-0.94
reluct
-0.92
fernando
-0.86
prouve
-0.84
lorenzo
-0.84
Souha
-0.84
écout
-0.83
disgra
-0.83
unspeak
-0.83
shenan
-0.83
POSITIVE LOGITS
active
0.82
actively
0.76
active
0.66
<bos>
0.64
activos
0.63
proactive
0.63
actively
0.58
aggressive
0.57
dynamic
0.57
lively
0.55
Activations Density 0.629%