INDEX
Explanations
phrases related to technological developments and industry analysis
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
138
+0.10
0.3%
803
+0.09
0.3%
870
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
870
+0.10
0.04
1038
+0.09
0.03
803
+0.08
0.04
Negative Logits
disreg
-1.09
disagre
-1.04
affor
-0.99
inconce
-0.98
cytoplas
-0.97
strick
-0.94
unspeak
-0.91
impra
-0.91
encomp
-0.90
inev
-0.90
POSITIVE LOGITS
new
1.04
new
0.96
nuevos
0.81
nueva
0.75
addition
0.75
additions
0.74
nuevas
0.72
newcomers
0.71
NEW
0.71
新的
0.71
Activations Density 0.554%