INDEX
Explanations
terms and phrases related to energy consumption and efficiency
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
269
+0.17
1.0%
156
+0.15
0.9%
219
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
269
+0.17
0.03
38
+0.15
0.01
219
+0.13
0.02
Negative Logits
adoles
-1.82
tears
-1.77
ieurs
-1.74
ieur
-1.69
apologize
-1.68
mistrial
-1.68
aughters
-1.66
hers
-1.65
iquit
-1.63
rude
-1.61
POSITIVE LOGITS
scape
1.80
seek
1.74
reserves
1.74
store
1.71
zone
1.69
core
1.67
corp
1.65
emitted
1.64
éĩı
1.63
consumption
1.63
Activations Density 0.097%