INDEX
Explanations
positive feedback related to the quality and construction of products
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1403
+0.12
0.4%
1013
+0.10
0.3%
736
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2044
+0.12
0.07
736
+0.10
0.06
1157
+0.09
0.05
Negative Logits
ideolog
-0.69
EoL
-0.69
Khach
-0.67
חיצוניים
-0.66
Palembang
-0.64
gubern
-0.60
gallina
-0.60
Michoacán
-0.59
Riau
-0.59
romero
-0.57
POSITIVE LOGITS
unspeak
1.22
shenan
1.17
indescri
1.17
disagre
1.15
indestru
1.14
impra
1.13
tolerably
1.13
apprehen
1.09
sophistic
1.08
felicity
1.08
Activations Density 0.698%